Recombinant canine brain natriuretic peptide

ABSTRACT

Peptides of the formula R 1  -Cys-Phe-Gly-Arg-Arg-Leu-Asp-Arg-Ile-Gly-Ser-Leu-Ser-Gly-Leu-Gly-Cys-R.sup.2 
     wherein R 1  is selected from the group consisting of: 
     (H); 
     Gly-; 
     Ser-Gly-; 
     Lys-Ser-Gly-; 
     His-Lys-Ser-Gly-; 
     Met-His-Lys-Ser-Gly-; 
     Thr-Met-His-Lys-Ser-Gly-; 
     Lys-Thr-Met-His-Lys-Ser-Gly-; 
     Pro-Lys-Thr-Met-His-Lys-Ser-Gly-; 
     Ser-Pro-Lys-Thr-Met-His-Lys-Ser-Gly-; 
     or is the amino acid sequence of the dog prepro sequence upstream of position 100 shown in FIG. 8 herein or a C-terminal portion thereof; 
     R 2  is (OH), NH 2 , or NR 2  wherein each R is independently H or lower alkyl (1-4C) or is 
     Asn; 
     Asn-Val; 
     Asn-Val-Leu; 
     Asn-Val-Leu-Arg; 
     Asn-Val-Leu-Arg-Lys; 
     Asn-Val-Leu-Arg-Lys-Tyr; 
     or the amides thereof are useful in treating conditions characterized by high levels of extracellular fluid.

CROSS-REFERENCE TO RELATED APPLICATION

This is a division of U.S. Ser. No. 07/477,226 filed Feb. 8, 1990, now U.S. Pat. No. 5,674,710 which is a divisional application of U.S. Ser. No. 07/299,880 filed Jan. 19, 1989, now abandoned, which is a continuation-in-part of U.S. Ser. No. 07/206,470 filed Jun. 14, 1988, now abandoned, which is a continuation-in-part of U.S. Ser. No. 07/200,383 filed May 31, 1988 and now abandoned. Also related is U.S. Ser. No. 07/460,855, now U.S. Pat. No. 5,114,923, issued May 19, 1992, which is a continuation-in-part of U.S. Ser. No. 07/299,880 listed above.

TECHNICAL FIELD

The invention relates generally to natriuretic and homologous peptides found in brain and cardiac tissue. More particularly, it relates to the gene encoding a natriuretic peptide obtained from porcine brain and genes encoding peptides related by amino acid sequence in other species.

BACKGROUND ART

The existence of peptides in the atrium which are responsible for maintenance of normal extracellular fluid parameters--i.e., the volume and pressure of liquid in the blood vessels--is well known. A series of closely related peptides, designated atrial natriuretic peptides, have been isolated from several species and identified, and analogs of these peptides have been prepared.

The natriuretic effect of a crude extract of rat atrial tissue was demonstrated over seven years ago. A number of peptides with diuretic and natriuretic properties have since been isolated from atrial tissue and sequenced: Flynn, T. G., et al., Biochem Biophys Res Commun (1983) 117:859-865; Currie, M. G., et al., Science (1984) 223:67-69; Kangawa, K., et al., Biochem Biophys Res Commun (1984) 118:131-139; U.S. Pat. No. 4,496,544; U.S. Pat. No. 4,508,712; Kangawa, K., et al., Biochem Biophys Res Commun (1984) 119:933-940; Garcia, R., et al., Biochem Biophys Res Commun (1985) 126:178-174; Katsube, N., et al., Biochem Biophys Res Commun (1985) 128:325-330; U.S. Pat. Nos. 4,607,023; 4,557,864; and 4,618,600; copending applications 616,488; 766,030; and 870,795. These peptides, called atrial natriuretic peptides (ANPs), are cyclic disulfides comprising 17 amino acids in the cycle (including the two cysteines which provide the disulfide bond). The gene which encodes them encodes a much longer protein which is then processed into shorter versions which make up the set of ANPs.

Various analogs of the isolated atrial peptides are also described in copending applications 921,360; 138,893; and 174,739.

It is understood that these peptides and their analogs are effective in regulating blood pressure by controlling fluid volume and vessel diameter. A number of disease states are characterized by abnormal fluid retention, including congestive heart failure, cirrhosis of the liver, and nephrotic syndrome. These diseases are associated with excessive fluid accumulation on the venous side of circulation, and an underperfusion of the kidneys, leading to a fall in glomerular filtration rate (GFR). In addition, reduced renal perfusion stimulates secretion of renin, a proteolytic enzyme whose which, in the circulation, leads to the formation of angiotensin, a powerful constrictor of the arteriole. Renin also stimulates release of the sodium-retaining hormone aldosterone by the adrenal gland.

Hypertension per se is another serious result of an increase in extracellular fluid volume and is a major cause of death.

Therapeutic measures related to diseases associated with sodium and water retention are varied and include administration of a variety of diuretic substances. However, no single therapeutic agent is satisfactory for all individuals, and it is important to enhance the repertoire of available materials. The present invention provides additional materials which, besides their supplementation of the repertoire of useful therapeutics, are important in that they are found in with brain and atrium and thus may shed light on the central and peripheral mechanisms whereby normal individuals maintain the appropriate fluid balance. In addition, some of these peptides and proteins have modified and altered physiological activities.

One of these factors from porcine brain has been isolated and sequenced by Sudoh, P., Nature (1988) 332:78-81. It is a 26-amino acid peptide synthesized in porcine brain and atrial tissue at about 1/100 of the concentration of analyzed atrial natriuretic peptide (ANP) activity. The spectrum of activity of this porcine brain natriuretic peptide, or pBNP, is similar to that of the porcine ANP. A comparison of the amino acid sequences (SEQ ID NOS: 1-3) of a portion of human ANP (hANP) and the pBNP is shown below; the corresponding relevant portion of the porcine ANP is identical to the human sequence.

                                         102                                                                    hANP    Arg-Ser-Ser-Cys-Phe-Gly-Gly-Arg-Met-A                                  sp-Arg-Ile-                                                                    pBNP    Asp-Ser-Gly-Cys-Phe-Gly-Arg-Arg-Leu-A                                  sp-Arg-Ile-                                                                             *1      *               *       *                                            126                                                                     Gly-Ala-Gln-Ser-Gly-Leu-Gly-Cys-Asn-Ser-Phe--                                  ---Arg-Tyr                                                                     Gly-Ser-Leu-Ser-Gly-Leu-Gly-Cys-Asn-Val-Leu-A                                  rg-Arg-Tyr                                                                          *   *                           *   *                                             26                                   

There are nine (starred) positions which are not homologous. The conservative substitution of Leu for Ile or Met, found in rat or human ANP sequences, respectively, is a known acceptable substitution.

Subsequent papers from this same group at Miyazaki Medical College further characterize these proteins. Sudoh, T., et al., Biochem Biophys Res Comm (1988) 155:726-732, report the isolation of a 32-amino acid natriuretic peptide ("BNP-32") from porcine brain which contains the 26 amino acids of the porcine BNP described above at its C-terminus and an additional N-terminal 6-amino acid extended portion of the sequence (SEQ ID NO:4) Ser-Pro-Lys-Thr-Met-Arg-. In papers following on subsequent pages, levels of various natriuretic peptides in tissues are reported. Ueda, S., et al., (ibid.), pp. 733-739, utilized a radioimmunoassay to localize and measure the levels of porcine BNP and porcine BNP-32 in the brain and spinal cord. The results showed that both BNP and BNP-32 were major forms of immunoreactive BNP in the porcine brain, and that the highest concentrations were found in the medulla-pons, striatum, and spinal cord. The porcine form of atrial natriuretic peptide (pANP) was also found in the porcine brain but at a level approximately 13 times lower than that characteristic of BNP. Minamino, N., et al. (ibid.), pp. 740-746, report the results of radioimmunoassay for porcine BNP and ANP in peripheral tissue. The concentration of BNP was highest in cardiac atrium of the tissues assayed. The immunoreactive form of this protein was characterized as mostly a 12 kd high molecular weight form; less than 15% of the total immunoreactive BNP in atrial tissue is of the lower molecular weight forms pBNP or pBNP-32.

In a subsequent issue of this publication, Minamino, N., et al., Biochem Biophys Res Comm (1988) 157:402-409, reported the isolation and characterization of this higher molecular weight form of BNP from porcine heart. The complete amino acid sequence of this protein was obtained and shown to contain the 26-amino acid pBNP (and 32-amino acid pBNP-32) at its carboxy terminus. The full-length protein contains 106 amino acids. Finally, Maekawa, K., et al. (ibid.), pp. 410-416, report the cloning and sequence analysis of a cDNA encoding a precursor protein for porcine BNP. A cDNA library was obtained from porcine cardiac atrium and the relevant BNP-encoding gene was isolated and sequenced. The gene was found to include a 25-residue putative signal peptide at the N-terminus followed by the codons corresponding to the 106 amino acids of the reported protein.

These results are consistent with the information available from studies of the atrial-derived natriuretic peptides which are generally also associated with longer precursors. In the parent application herein, the gene encoding porcine BNP was provided, which permitted the putative amino acid sequence of the upstream portion of these precursor proteins to be deduced. While the cDNA obtained in the parent application was incompletely processed and contained an intron, further manipulation of this sequence using standard techniques as described below permitted the location of the intron to be established. Furthermore, the availability of the cDNA encoding pBNP permitted, with considerable effort and ingenuity as shown below, retrieval of genes encoding proteins of similar amino acid sequences from human and dog genomic libraries. Accordingly, the invention provides access to a family of natriuretic peptides (NPs) and natriuretic-related peptides (NRPs) from a variety of vertebrate sources.

DISCLOSURE OF THE INVENTION

The invention provides the complete gene sequence for pBNP and the prepro form thereof and thus the ability to synthesize large amounts of the proteins encoded by this gene and modified forms thereof. The invention also enables retrieval of the gene sequences encoding proteins of similar amino acid sequence having natriuretic activity from other vertebrate species, and thus provides the ability to synthesize them as well. The cDNA encoding the porcine BNP and its precursors and perhaps shorter associated brain proteins is shown in FIG. 1; the segment of this "unprocessed" cDNA which encodes the 26 amino acid pBNP described by Sudoh (supra) is underlined.

Accordingly, in one aspect, the invention is directed to a recombinant cDNA probe containing the sequence encoding the 26-amino acid natriuretic peptide of porcine brain, which comprises the DNA of FIG. 1 or an effective portion thereof. The invention is also directed to recombinant DNA sequences retrieved using this probe, or probes derived from it, and thus includes alternatively useful probes which comprise effective portions of the coding sequences for peptides from canine and human sources shown in FIGS. 5 and 7.

In another aspect, the invention is directed to peptides having natriuretic activity of the formula (SEQ ID NO: 5):

    Asp-Arg-Ile--Phe-Gly-Arg- Arg/ (1)                                                                 Lys   Met                                                  Ser-Gly-Leu-Gly-Cys-R.sup.2                                                       Ser        Ser                                                         

wherein R¹ (SEQ ID NO: 6-11) is selected from the group consisting of:

                                                (H);                                                                          Gly-;                                                                      Ser-Gly-;                                                                Asp/                                          Ser-Gly-;                        Lys/                                                                           Gly                                                                     Arg/   Asp/                                          Ser-Gly-;                 His/                                                                           Gln    Gly                                                                     Arg/   Asp/                                          Ser-Gly-;                                                                                         Val    Gln    Gly                                                                     Arg/   Asp/                                          Ser-Gly-;    Thr/                                                                           Met   Val    Gln    Gly                                                                     Arg/   Asp/                                          Ser-Gly-;ys- Thr/                                                                           Met   Val    Gln    Gly                                                                     Arg/   Asp/                                          Ser-Gly-;ys- Thr/                                                                           Met   Val    Gln    Gly                                                                     Arg/   Asp/                                          Ser-Gly-;ys- Thr/                                                                           Met   Val    Gln    Gly                                      

or a 10- to 109-amino acid sequence shown as the native upstream sequence for porcine, canine or human BNP in FIG. 8, or a composite thereof;

R² (SEQ ID NOS: 12-14) is (OH), NH₂, or NR'R" wherein R' and R" are independently lower alkyl (1-4C) or are

    Asn/                                                                           Lys                                                                            Val/                                                                           Lys                                                                            Val-Leu                                                                        Lys                                                                            Val-Leu-Arg                                                                    Lys                                                                            Val-Leu-Arg- Arg/                                                              Lys                Lys                                                         Tyr/Leu-Arg- Arg/                                                              Lys                Lys   His                                                   or the amides (NH.sub.2  or NR'R") thereof,                                    with the proviso that if formula (1) (SEQ ID NO: 15) is                        R.sup.1 -Cys-Phe-Gly-Arg-Arg-Leu-Asp-Arg-                                         Ile-Gly-Ser-Leu-Ser-Gly-Leu-Gly-Cys-R.sup.2                                 and R.sup.1  is Asp-Ser-Gly-,                                                  R.sup.2  cannot be Asn-Val-Leu-Arg-Arg-                                        Tyr (SEQ ID NO: 16).                                                      

In other aspects, the invention is related to recombinant DNA sequences encoding the foregoing peptides and to recombinant expression systems capable of production of these peptides in suitably transformed hosts. The invention is also related to methods to produce the peptides of the invention using recombinant means by culturing the transformed cells and recovering the desired peptide from the cell cultures.

The invention is also directed to modified forms of this class of peptides wherein 1 or 2 of the positions contain conservative amino acid substitutions.

The invention also relates to pharmaceutical compositions and methods of treatment using the peptides of the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 (SEQ ID NOS: 17-30) shows the complete sequence of a retrieved cDNA in unprocessed form which encodes porcine BNP. The portion of the sequence which encodes the 26-amino acid pBNP peptide is underlined and consists of residues 660-723 and 1276-1289 inclusive.

FIG. 2 (SEQ ID NOS: 31-37) shows oligonucleotides synthesized as probes for pBNP-encoding cDNA.

FIG. 3 (SEQ ID NOS: 38-39) shows the cDNA of FIG. 1 with the location of the additional intron established.

FIG. 4 (SEQ ID NOS: 40-41) shows the coding portions of the pBNP-encoding cDNA absent the introns.

FIG. 5 (SEQ ID NOS: 42-43) shows the DNA and deduced protein sequence for the coding portions of the gene encoding a canine protein with natriuretic activity.

FIG. 6 shows Southern blots of human genomic DNA probed with human ANP (left panel) and with canine NRP (right panel).

FIG. 7 (SEQ ID NOS: 44-45) shows the DNA and deduced amino acid sequence of the human genomic clone encoding the human NRP.

FIG. 8 (SEQ ID NOS: 46-48) shows a comparison of the amino acid sequences of the prepro forms of the porcine, canine and human proteins of the invention.

MODES OF CARRYING OUT THE INVENTION

A. Definitions

As used herein, "brain natriuretic peptide (BNP)" refers to an amino acid sequence which is encoded by a DNA capable of hybridizing to an effective portion of the DNA shown in FIG. 1 under defined stringency conditions, and which has natriuretic activity. It is believed that the brains of all vertebrates contain a subpopulation of peptides with this activity which comprise peptides analogous to that disclosed herein as pBNP and longer precursor proteins containing this amino acid sequence, as well as active fragments thereof.

As used herein, "porcine brain natriuretic peptide (pBNP)" refers to the 26 amino acid sequence isolated by Sudoh et al., and set forth hereinabove. "pBNP-encoding cDNA" refers to the nucleotide sequence shown in FIG. 1 herein, comprising residues 660-723 and 1276-1289 inclusive. The separation in the cDNA of the pBNP codons is presumably due to incomplete processing of the mRNA which formed the template for this particular clone. This clone was deposited at the American Type Culture Collection, Rockville, Md., on Jun. 10, 1988 and has accession number ATCC 40465.

The pBNP-encoding CDNA shown in FIG. 1, because it contains additional sequences encoding precursor proteins, and, as explained below, presumably contains nucleotides corresponding to an additional intron besides that represented by the sequence separating the pBNP-encoding portion per se, can be used as an effective probe to obtain either genomic or cDNA sequences encoding corresponding associated brain natriuretic peptides in various vertebrate species. "Precursor brain natriuretic peptide" as used in the present application refers to peptides with natriuretic activity encoded by the gene sequence from which, for example, the pBNP protein is derived but processed so as to obtain peptides of different length. Similar processing differences presumably exist in other vertebrates as well; the entire class of such natriuretic peptides is retrieved by the DNA probe of the invention. For example, examination of the reading frame of the pBNP-encoding DNA shows an N-terminal extension so that N-terminally extended peptides can be postulated. It has been shown that the ANP precursor "pro-ANP" is processed differently in atrial and brain tissues leading to different ANP peptides. By analogy to the peptides found in the atrium, it is postulated that an important peripheral form of BNP would be the 29-residue peptide of pBNP N-terminally extended with the tripeptide Thr-Met-Arg. Further N-terminal extended peptides with the additional upstream residues Ser-Pro-Lys and Gly-Ile-Arg-Ser-Pro-Lys are also expected. Thus, examination of the reading frame which contains the pBNP also permits postulation of additional upstream processing sites which would extend the N-terminal sequence further.

Other extended precursor peptides are discoverable through standard techniques using the sequence information of FIG. 1. It is clear, by analogy with atrial natriuretic peptide precursors, that the start of the longest precursor, perhaps including a signal sequence, is at the methionine codon shown in the upper-most reading frame in the line spanning nucleotide 61 and 120, or at the closely positioned downstream ATG. Therefore, it is clear that the reading frame is not maintained from this start of translation into the pBNP-encoding region. This indicates that there is at least one other intron transcribed into the cDNA clone retrieved. The location of this intron and deduction of the full sequence for the longest form of precursor peptide is described in further detail below. In any event, precursor BNP peptides associated with pBNP include other natriuretic peptides encoded by this depicted gene; analogous groups of peptides are collectively designated natriuretic "NP" peptides in other species.

Additional terminology which is useful is the term "prepro" NP, which refers to the encoded peptide having both the native associated signal sequence which effects secretion of the various forms of the peptide with natriuretic activity and an amino acid sequence of the secreted peptide which is fused upstream of the cyclic portion absolutely required for this activity. The "pro" form having the upstream sequence may represent the circulating form of the peptide. With respect to the three specific embodiments included within the present invention, which are shown in detail in FIGS. 3, 5, and 7 for porcine, canine and human proteins, respectively, the location of the putative signal sequences representing the "pre" sequence is shown in each figure, as well as the full-length mature protein, which is thought to be a precursor form designated the "pro" form. Because various processing sites are available, as indicated by the upward-pointing arrows in these figures and in the composite sequences shown in FIG. 8, attempts to make a fine-line and definite distinction between the "pro" NP and "NP" are probably meaningless. The peptides defined by the invention are set forth in formula (1) above and have natriuretic activity, regardless of the length of the N-terminal form preceding the identified 26-amino acid regions corresponding to the porcine "BNP" of Sudoh, or attached to the cyclic portion thereof.

"Expression system" refers to a DNA which contains a coding region operably linked to suitable control sequences capable of effecting its expression in a compatible host. Expression systems invariably comprise a promoter, but, depending on the host intended, may contain additional critical DNA such as ribosome binding site or CAP site, termination sequence, and optional enhancer sequences upstream from the promoter or in other operable locations. The recombinant expression systems of the invention herein comprise a DNA of the invention encoding a BNP, for example, a BNP derived from a vertebrate source, operably linked to additional DNA sequences which are capable of effecting its expression. The expression system may reside on a transfer vector such as a plasmid or a viral vector which is self-replicating independently of the chromosome of the host cell, or may be constructed so that when inserted into a host cell it is able to integrate into the chromosome.

B. Other Associated Porcine BNPs and Retrieval of Vertebrate NRP Genes

The invention, in one aspect, is directed to all members of the group of porcine BNP proteins encoded in the cDNA shown in FIG. 1, and to conservative modifications thereof. The deduction of the amino acid sequence encoding the longest precursor protein, and therefore deduction of the processed forms, can be accomplished using the unprocessed cDNA here provided. In this standard approach, oligonucleotide sequences representing short portions of the cDNA spanning the potential intron--i.e., between residues 100 to about 660--are synthesized, labeled, and used to probe Northern blots of mRNA isolated from cells producing BNP. Most mRNAs will be in processed form; hence, those oligonucleotides which successfully hybridize to the proper length message represent coding regions of the cDNA. Those which do not readily hybridize represent intron regions. By using overlapping synthetic cDNAs, the intron position can be precisely identified. This permits deduction of the complete sequence encoding the largest precursor protein, and defines the sequence from which the associated BNP proteins are formed.

In a modification of this approach, partial cDNA fragments were generated in amplified form from mRNA isolated from porcine atrium. The cDNA for amplification was obtained by hybridization of poly A⁺ RNA isolated from this tissue with the oligonucleotide 3895. Amplification was performed using a polymerase chain reaction wherein the oligonucleotide primers corresponded to bases 100-123 (identity strand) and 652-685 (complementary strand) as shown in FIG. 1. Two bands are obtained when the amplified products are analyzed on preparative agarose gels; the larger band in low relative abundance presumably represents the smaller DNA derived from the unspliced precursor, and the more prominent band is assumed to be the more fully processed cDNA. When this band was eluted from the gel and sequenced, the stretch corresponding to bases 223-468 of FIG. 1 was not present, and the recovered DNA had the sequence shown in FIG. 3.

Thus, using standard techniques, the location of the putative upstream intron, which would correspond to that found in atrial natriuretic peptide precursors, as described by Greenberg et al., Nature (1984) 312:656-658, was easily obtained. As shown in FIG. 4, which represents solely the portions shown as coding sequences of FIG. 3, a reading frame of 131 amino acids is obtained. It is believed that the signal sequence is represented by amino acids 1-25, and that the cleavage site converting the prepro form of porcine BNP to the pro form is between Ser₂₅ and His at position 26 of the prepro sequence, as shown in FIG. 4. The sequence of the porcine BNP reported by Sudoh having 26 amino acids is represented by amino acid residues 81-106 (106-131).

In addition to providing access to the class of porcine BNPs encoded on the retrieved cDNA, the cDNA of FIG. 1 provides access to the corresponding precursor gene encoding the class of associated NRP proteins from various vertebrate species.

The pBNP-encoding cDNA shown in FIG. 1, or an effective portion thereof can be used as a probe in gene libraries obtained from other vertebrate hosts, by a number of procedures generally known in the art. The source of the desired genes can be either a genomic library appropriate to the vertebrate species or a cDNA library from cells synthesizing the peptide. As explained below, although these peptides are synthesized in brain, they are also known to occur in atrial tissue at a lower level than the normally produced ANP. Therefore, the more readily accessible atrial tissue can also be used to prepare the cDNA library to be probed in preference to brain tissue. In this instance, high stringency and +/- hybridization can be used to distinguish the more predominant ANP-encoding DNA. Preparation of both genomic and cDNA libraries is well known in art; indeed, some genomic libraries are commercially available. Preferred techniques for preparing cDNA libraries are disclosed by Hyunh, V. T., et al., DNA Cloning Techniques--A Practical Approach (IRL Press, Oxford, 1984), and by Okayama and Berg, Mol Cell Biol (1983) 3:280-289. Preferably, the procedure exemplified below can, for example, be followed.

The genomic or cDNA library is then probed under nonstringent conditions (e.g., 20% formamide, 6× SSC at 37° C.) to obtain hybridizing sequences. The retrieved sequences can then be analyzed and sequenced according to standard procedures.

The entire pBNP-encoding DNA of FIG. 1 can be used as a probe, or an effective portion can be used. What constitutes an effective portion depends on the nature of the library being probed and can be determined experimentally. In general, if a genomic library is the source for retrieval of the desired gene, a segment of the pBNP-encoding cDNA extending from about residue 601-1300 is convenient. This segment bridges the intron which interrupts the coding sequence for the pBNP protein. Upstream portions of the sequence could also be used. Of course, there is no particular disadvantage in using the entire clone. On the other hand, if cDNA libraries are being investigated, it may be desirable to use only a portion of the cDNA which represents the ordinarily spliced coding regions. Thus, for example, a convenient probe might be a contiguous DNA sequence representing the pBNP codons (positions 660-723 and 1276-1289) or a somewhat smaller segment thereof.

It is understood, of course, that the actual probes may be sequences shown, or preferably their complements.

In practice, the porcine DNA of FIG. 1 was able to retrieve genes encoding related proteins in genomic libraries from a variety of other species, either directly or indirectly. Genomic libraries from pig, rat, dog, cat and rabbit showed the ability to hybridize to the probe of FIG. 1 under at least one of the conditions:

(1) 50% formamide, 6× SSC, 5× Denhardt's, 10 mM sodium phosphate, 10 ug/ml sheared DNA at 42° C.; and

(2) 20% formamide, 6× SSC, 5× Denhardt's, 10 mM sodium phosphate, 10 ug/ml sheared DNA at 37° C.

Under both hybridization conditions, washing was at 1× SSC, 0.1% SDS at 50-60° C. for 1 hour.

Human genomic DNA did not hybridize to the DNA sequence of FIG. 1 under these conditions, but could be obtained indirectly using the other mammalian DNAs obtained using this probe.

By obtaining the canine DNA through use of the porcine probe, an insert having the sequence shown in FIG. 5 was obtained, was designated pdBNP-1, and was deposited at the American Type Culture Collection on Dec. 14, 1988 under Accession No. ATCC-67862. Using this sequence as a probe, a clone obtained from EcoRI-digested human genomic DNA was found which encodes a similar protein having natriuretic activity. This human DNA has the sequence shown in FIG. 7, was designated phBNP-1, and was deposited at the American Type Culture Collection on Dec. 14, 1988 under Accession No. ATCC-67863.

The amino acid sequence encoding the putative prepro forms of peptides with natriuretic activity from porcine, canine, and human species are shown in FIG. 8. It is apparent that the porcine and canine species are more homologous in the region putatively responsible for natriuretic activity than the human sequence. Using the information in FIG. 8, a class of peptides having natriuretic activity can be defined. This class is of the formula (SEQ ID NO: 5).

    R.sup.1 --Cys--Phe--Gly--Arg--Arg/ --  Leu/ --Asp--Arg--Ile--                                                 (1)                                                                Lys     Met                                                    Gly/ --Ser-- Leu/ --Ser--Gly--Leu--Gly--Cys--R.sup.2                           Ser        Ser                                                         

wherein R¹ (SEQ ID NOS: 6-11) is selected from the group consisting of:

                                                (H);                                                                          Gly--;                                                                     Ser--Gly--;                                                              Asp/                                                                           Lys/ --Ser--Gly--;                                                             Gly                                                                     Arg/   Asp/                                                                    His/ -- Lys/ --Ser--Gly--;                                                     Gln    Gly                                                                     Arg/   Asp/                                                             Met/ -- His/ -- Lys/ --Ser--Gly--;                                             Val    Gln    Gly                                                                     Arg/   Asp/                                                       Thr/ --Met/ -- His/ -- Lys/ --Ser--Gly--;                                      Met   Val    Gln    Gly                                                                     Arg/   Asp/                                                  Lys-- Thr/ --Met/ -- His/ -- Lys/ --Ser--Gly--;                                     Met   Val    Gln    Gly                                                                     Arg/   Asp/                                              Pro--Lys-- Thr/ --Met/ -- His/ -- Lys/ --Ser--Gly--;                                    Met   Val    Gln    Gly                                                                     Arg/   Asp/                                          Ser--Pro--Lys-- Thr/ --Met/ -- His/ -- Lys/ --Ser--Gly--;                                   Met   Val    Gln    Gly                                      

or a 10- to 109-amino acid sequence shown as the native upstream sequence for porcine, canine or human BNP in FIG. 8, or a composite thereof;

R² (SEQ ID NOS: 12-14) is (OH), NH₂, or NR'R" wherein R' and R" are independently lower alkyl (1-4C) or isAsn/LysAsn/ --ValLysAsn/ --Val--LeuLysAsn/ --Val--Leu--ArgLysAsn/ --Val--Leu--Arg-- Arg/Lys LysAsn/ --Val--Leu--Arg-- Arg/ --Tyr/Lys Lys Hisor the amides (NH₂ or NR'R") thereof,with the proviso that if formula (1) (SEQ ID NO: 15) isR¹ --Cys--Phe--Gly--Arg--Arg--Leu--Asp--Arg-- Ile--Gly--Ser--Leu--Ser--Gly--Leu--Gly--Cys--R²and R¹ is Asp--Ser--Gly--, R² cannot beAsn--Val--Leu--Arg--Arg--Tyr (SEQ ID NO: 16).

As used above, "composite" of the sequences shown as native upstream sequences for porcine, canine or human BNP in FIG. 8 refers to upstream sequences as there shown, where each position contains the alternative amino acids shown for all three species interchangeably. The composites of these sequences are constructed as were the composites for the specifically designated sequences in formula (1) and definitions of R¹ and R² above.

In addition, these peptide sequences can be modified by substituting one or two conservative amino acid substitutions for the positions specified, including substitutions which utilize the D rather than L form. As these peptides can be synthesized using standard solid-phase techniques, for example, it is not necessary to confine the conservative substitutions to amino acids encoded by genes.

Upon retrieval of the precursor gene encoding the BNP family for a particular vertebrate species, deduction of the BNP peptides associated with that species is a matter of translation of the determined sequence and identification of the processing site. Guidance is given by virtue of the pattern with respect to the atrial natriuretic peptide counterparts. It is believed that in analogy to the atrial peptides, the BNP proteins are cyclic disulfides formed by oxidation of the cysteine residues at positions 4 and 20 of the sequence shown above for pBNP. The class of BNP peptides encoded for a particular species is believed to include truncated forms exocyclic to this disulfide bonded ring as well as extended forms of the PBNP shown, including peptides with one or two conservative amino acids substitutions in the sequence.

A deduced (or otherwise generated) peptide sequence falls within the scope of certain natriuretic proteins of the invention, provided that the DNA encoding it directly or indirectly hybridizes to the pBNP-encoding cDNA of FIG. 1 under conditions corresponding to the stringency represented by hybridization in buffer containing 20% formamide, 5× Denhardt's, 6× SSC, 100 mg/ml RNA, and 0.05% sodium pyrophosphate at 42° C., followed by washing at 60° C. at 1× SSC, 0.1% SDS, or under conditions (1) or (2) described above. In addition, the peptide encoded by this DNA must exhibit natriuretic activity assayed as described below.

By "direct hybridization" is meant that the DNA encoding the natriuretic peptide hybridizes to the DNA shown in FIG. 1 or an effective portion thereof per se. By "indirect hybridization" is meant that the DNA hybridizes to an DNA which is capable itself of hybridizing to the porcine BNP of FIG. 1. Thus, the human sequence shown in FIG. 7 indirectly hybridizes to the porcine BNP through the canine sequence of FIG. 5.

The invention is also directed to modified forms of the BNP proteins encoded by the cDNA of FIG. 1. One or two of the positions of these BNPs can be altered, so long as activity is retained. Conservative amino acid substitutions are preferred--that is, for example, aspartic/glutamic as acidic amino acids; lysine/arginine/histidine as basic amino acids; leucine/isoleucine, methionine/valine as hydrophobic amino acids; serine/glycine/alanine/threonine as hydrophilic amino acids. However, as the peptides need not be prepared by recombinant methods or from the gene, the substitutions may include nonencoded amino acids such as the D- or beta-amino forms.

B. Production of BNP

The BNP protein of the invention can be produced in a variety of ways, including using recombinant methods.

The retrieved genes encoding BNP peptides can then be manipulated and expressed using a variety of recombinant systems. Peptides having the sequence encoded by any subsegment of the retrieved gene can be obtained in host systems which do not process the translated protein by proper design of the expression system. For example, the expression system is constructed by placing an ATG start codon immediately preceding the desired N-terminus and a termination codon after the desired C-terminus, with appropriate modification of any adjacent sequences. The desired coding sequence is then ligated in operable linkage to a control system functional in procaryotic or eucaryotic hosts, as desired. A large number of control systems are now known in the art.

As the natriuretic peptide precursors are evidently processed in certain eucaryotic systems, attention should be paid to the choice of the recombinant host, or it is possible to prevent processing by modification of the gene sequence so as to encode substitute amino acids in positions believed to be susceptible to cleavage by proteolytic enzymes. For example, the arginine immediately upstream from the aspartic acid residue at position one of pBNP could be replaced by a threonine residue, thus rendering the resulting peptide non-susceptible to trypsin cleavage at that site. In the alternative, expression can be effected in hosts which are deficient in enzymes capable of processing these peptides.

As the genes encoding the families of natriuretic-related peptides for various vertebrate species are made accessible by the availability of the probes constructed from pBNP-encoding DNA, these genes can be manipulated by replacing the codons for one or more amino acids by site directed mutagenesis, to obtain sequences encoding analogs of these peptides which retain natriuretic activity.

Construction of expression vectors and recombinant production from the appropriate DNA sequences are performed by methods known in the art per se.

Expression can be in procaryotic or eucaryotic systems. Procaryotes most frequently are represented by various strains of E. coli. However, other microbial strains may also be used, such as bacilli, for example Bacillus subtilis, various species of Pseudomonas, or other bacterial strains. In such procaryotic systems, plasmid vectors which contain replication sites and control sequences derived from a species compatible with the host are used. For example, E. coli is typically transformed using derivatives of pBR322, a plasmid derived from an E. coli species by Bolivar et al., Gene (1977) 2:95. Commonly used procaryotic control sequences, which are defined herein to include promoters for transcription initiation, optionally with an operator, along with ribosome binding site sequences, including such commonly used promoters as the beta-lactamase (penicillinase) and lactose (lac) promoter systems (Chang et al., Nature (1977) 198:1056) and the tryptophan (trp) promoter system (Goeddel et al., Nucleic Acids Res (1980) 8:4057) and the lambda-derived P_(L) promoter and N-gene ribosome binding site (Shimatake et al., Nature (1981) 292:128). Any available promoter system compatible with procaryotes can be used.

The expression systems useful in the eucaryotic hosts comprise promoters derived from appropriate eucaryotic genes. A class of promoters useful in yeast, for example, include promoters for synthesis of glycolytic enzymes, including those for 3-phosphoglycerate kinase (Hitzeman et al., J Biol Chem (1980) 255:2073). Other promoters include those from the enolase gene (Holland, M. J., et al. J Biol Chem (1981) 256:1385) or the Leu2 gene obtained from YEp13 (Broach, J., et al., Gene (1978) 8:121).

Suitable mammalian promoters include metallothionein, the early and late promoters from SV40 (Fiers et al., Nature (1978) 273:113) or other viral promoters such as those derived from polyoma, adenovirus II, bovine papilloma virus or avian sarcoma viruses. Suitable viral and mammalian enhancers are cited above. In the event plant cells are used as an expression system, the nopaline synthesis promoter is appropriate (Depicker, A., et al., J Mol Appl Gen (1982) 1:561).

The expression system is constructed from the foregoing control elements operably linked to the BNP sequences using standard methods, employing standard ligation and restriction techniques which are well understood in the art. Isolated plasmids, DNA sequences, or synthesized oligonucleotides are cleaved, tailored, and religated in the form desired.

Site specific DNA cleavage is performed by treating with the suitable restriction enzyme (or enzymes) under conditions which are generally understood in the art, and the particulars of which are specified by the manufacturer or these commercially available restriction enzymes. See, e.g., New England Biolabs, Product Catalog. In general, about 1 ug of plasmid or DNA sequence is cleaved by one unit of enzyme in about 20 ul of buffer solution; in the examples herein, typically, an excess of restriction enzyme is used to insure complete digestion of the DNA substrate. Incubation times of about 1 hr to 2 hr at about 37° C. are workable, although variations can be tolerated. After each incubation, protein is removed by extraction with phenol/chloroform, and may be followed by ether extraction, and the nucleic acid recovered from aqueous fractions by precipitation with ethanol followed by running over a Sephadex G-50 spin column. If desired, size separation of the cleaved fragments may be performed by polyacrylamide gel or agarose gel electrophoresis using standard techniques. A general description of size separation is found in Methods in Enzymology (1980) 65:499-560.

Restriction cleaved fragments may be blunt ended by treating with the large fragment of E. coli DNA polymerase I (Klenow) in the presence of the four deoxynucleotide triphosphates (dNTPs) using incubation times of about 15 to 25 min at 20 to 25° C. in 50 mM Tris pH 7.6, 50 mM NaCl, 6 mM MgCl₂, 6 mM DTT and 5-10 uM dNTPs. The Klenow fragment fills in a 5' sticky ends but chews back protruding 3' single strands, even though the four dNTPs, are present. If desired, selective repair can be performed by supplying only one of the, or selected, dNTPs within the limitations dictated by the nature of the sticky ends. After treatment with Klenow, the mixture is extracted with phenol/chloroform and ethanol precipitated followed by running over a Sephadex G-50 spin column. Treatment under appropriate conditions with S1 nuclease results in hydrolysis of any single-stranded portion.

Synthetic oligonucleotides are prepared using commercially available automated oligonucleotide synthesizers. Kinasing of single strands prior to annealing or for labeling is achieved using an excess, e.g., approximately 10 units of polynucleotide kinase to 0.1 nmole substrate in the presence of 50 mM Tris, pH 7.6, 10 mM MgCl₂, 5 mM dithiothreitol, 1-2 mM ATP, 1.7 pmoles ³² P-ATP (2.9 mCi/mmole), 0.1 mM spermidine, 0.1 mM EDTA.

Ligations are performed in 15-30 ul volumes under the following standard conditions and temperatures: 20 mM Tris-HCl pH 7.5, 10 mM MgCl₂, 10 mM DTT, 33 ug/ml BSA, 10 mM-50 mM NaCl, and either 40 uM ATP, 0.01-0.02 (Weiss) units T4 DNA ligase at 0° C. (for "sticky end" ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA ligase at 14° C. (for "blunt end" ligation). Intermolecular "sticky end" ligations are usually performed at 33-100 ug/ml total DNA concentrations (5-100 nM total end concentration). Intermolecular blunt end ligations (usually employing a 10- to 30-fold molar excess of linkers) are performed at 1 uM total ends concentration.

In vector construction employing "vector fragments," the vector fragment is commonly treated with bacterial alkaline phosphatase (BAP) in order to remove the 5' phosphate and prevent religation of the vector. BAP digestions are conducted at pH 8 in approximately 150 mM Tris, in the presence of Na⁺ and Mg⁺² using about 1 unit of BAP per ug of vector at 60° C. for about 1 hr. In order to recover the nucleic acid fragments, the preparation is extracted with phenol/chloroform and ethanol precipitated and desalted by application to a Sephadex G-50 spin column. Alternatively, religation can be prevented in vectors which have been double digested by additional restriction enzyme digestion of the unwanted fragments.

For portions of vectors derived from cDNA or genomic DNA which require sequence modifications, site specific primer directed mutagenesis is used. This is conducted using a primer synthetic oligonucleotide complementary to a single stranded phage DNA to be mutagenized except for limited mismatching, representing the desired mutation. Briefly, the synthetic oligonucleotide is used as a primer to direct synthesis of a stand complementary to the phage, an the resulting double-stranded DNA is transformed into a phage-supporting host bacterium. Cultures of the transformed bacteria are plated in top agar, permitting plaque formation from single cells which harbor the phage.

Theoretically, 50% of the new plaques will contain the phage having, as a single strand, the mutated form; 50% will have the original sequence. The resulting plaques are hybridized with kinased synthetic primer at a temperature which permits hybridization of an exact match, but at which the mismatches with the original strand are sufficient to prevent hybridization. Plaques which hybridize with the probe are then picked, cultured, and the DNA recovered. Details of site specific mutation procedures are described below in specific examples.

Correct ligations for plasmid construction can be confirmed by first transforming E. coli strain MM294 obtained from E. coli Genetic Stock Center, CGSC #6135, or other suitable host with the ligation mixture. Successful transformants are selected by ampicillin, tetracycline or other antibiotic resistance or using other markers depending on the mode of plasmid construction, as is understood in the art. Plasmid from the transformants are then prepared according to the method of Clewell, D. B., et al., Proc Natl Acad Sci USA (1969) 62:1159, optionally following chloramphenicol amplification (Clewell, D. B., J Bacteriol (1972) 110:667). The isolated DNA is analyzed by restriction and/or sequenced by the dideoxy method of Sanger, F., et al., Proc Natl Acad Sci USA (1977) 74:5463 as further described by Messing et al., Nucleic Acids Res (1981) 9:309, or by the method of Maxam et al., Methods in Enzymology (1980) 65:499.

The constructed vector is then transformed into a suitable host.

Depending on the host cell used, transformation is done using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described by Cohen, S. N., Proc Natl Acad Sci USA (1972) 69:2110, or the RbCl method described in Maniatis et al., Molecular Cloning: A Laboratory Manual (1982), Cold Spring Harbor Press, p. 254, is used for procaryotes or other cells which contain substantial cell wall barriers. Infection with Agrobacterium tumefaciens (Shaw, C. H., et al., Gene (1983) 23:315) is used for certain plant cells. For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology (1978) 52:546, is preferred. Transformations into yeast are carried out according to the method of Van Solingen, P., et al., J Bacter (1977) 130:946, and Hsiao, C. L., et al., Proc Natl Acad Sci USA (1979) 76:3829.

The transformed cells are then cultured under conditions favoring expression of the BNP sequence and the recombinantly produced protein recovered from the culture.

In addition to recombinant production, peptides whose deduced sequences are sufficiently short to make direct peptide synthesis practical can be prepared using standard solid-phase techniques.

Thus, compounds within the scope of the present invention can be synthesized chemically by means well known in the art such as, e.g., solid-phase peptide synthesis. The synthesis is commenced from the carboxy-terminal end of the peptide using an alpha-amino protected amino acid. t-Butyloxycarbonyl (Boc) protective groups can be used for all amino groups even though other protective groups are suitable. For example, Boc-Val-OH, Boc-Leu-OH, Boc-Arg-OH or Boc-Tyr-OH (i.e., selected BNP analog carboxy-terminal amino acids) can be esterified to chloromethylated polystyrene resin supports. The poly-styrene resin support is preferably a copolymer of styrene with about 0.5 to 2% divinyl benzene as a cross-linking agent which causes the polystyrene polymer to be completely insoluble in certain organic solvents. See Stewart, et al., Solid-Phase Peptide Synthesis (1969), W. H. Freeman Co., San Francisco, and Merrifield, J Am Chem Soc (1963) 85:2149-2154. These and other methods of peptide synthesis are also exemplified by U.S. Pat. Nos. 3,862,925; 3,842,067; 3,972,859; and 4,105,602.

The synthesis may use manual techniques or automatically employing, for example, an Applied Biosystems 430A Peptide Synthesizer (Foster City, Calif.) or a Biosearch SAM II automatic peptide synthesizer (Biosearch, Inc., San Rafael, Calif.), following the instructions provided in the instruction manual supplied by the manufacturer.

Of course, since automated synthesis also permits control of the sequence, the above-mentioned modifications to the amino acid sequence obtained by modifying the gene as described above are available using this method of synthesis. In addition, it is not necessary that the substituted amino acid be encoded by a gene. Therefore, the D-forms or beta-amino acids can be substituted for those natively present.

The foregoing methods to synthesize the BNP of the invention are not intended to be limiting, and the BNP of the invention may be prepared in any convenient manner. The BNP is required only to be encoded by a gene which hybridizes under the above-specified stringent conditions to the cDNA of FIG. 1, and to show natriuretic activity in the receptor assay described below.

C. Assay Systems

The members of the natriuretic peptides of the invention from the various vertebrate species and the modifications thereof can be verified to have the required natriuretic activity using standard methods to assay such activity. A number of systems both in vitro and in vivo are available. The simplest form of in vitro test is a binding assay testing the affinity of the peptides for receptors in the kidney and other sites responsible for influencing the clearance of the endogenous natriuretic compounds. Accordingly, in a manner analogous to the assay procedure for the atrial derived natriuretic peptides, natriuretic activity in general can be assayed by the ability of the candidate peptide to compete with pBNP which has been labeled, for example, by iodination for binding to receptors from cultured bovine aortic smooth muscle (BASM) cells and bovine aortic endothelial (BAE) cells. The competition is diagnostic for the binding to the relevant clearance receptors. In addition, levels of cyclic GMP can be measured in these same cells and are diagnostic for binding of the peptide to relevant biological receptors responsible for the observed in vivo bioactivity.

To fall within the scope of the compositions claimed herein, the candidate peptide must be encoded by a gene sequence capable of hybridizing directly or indirectly to pBNP-encoding DNA under the stringency conditions set forth herein, or be defined by formula (1) or its modified forms as set forth above, and must show activity in an in vitro receptor binding assay; either binding to the clearance receptors as shown by the competition assay, or to the effector receptors as shown by the alteration of cyclic GMP levels, or both. The peptide may or may not have direct biological activities associated with pBNP, such as cyclic GMP activity, if it inhibits clearance receptor binding in a manner that correlates with an in vivo test for natriuretic and diuretic activities. The peptides may also be vasodilators.

Receptor Binding Assays

Specific ANP receptor sites have been identified on target tissues, such as kidney, adrenal, blood vessels, and cultured cells. Napier, M. A., et al., Proc Nat Acad Sci USA (1984) 81:5946-5940; DeLean, A., et al., Endocrinology (1984) 115:1636-1638; Schenk, D. B., et al., Biochem Biophys Res Comm (1985) 127:433-442. Such tissues will have receptors for BNP binding which may or may not be identical to those for ANP. Since the binding of ANP or ANP analogs to these specific receptor sites is presumptively a prerequisite of biological activity, binding of BNP-associated peptides, or their modified forms to these receptors is considered predictive of biological activity.

An assay has been developed, generally in accordance with the disclosure of Schenk, supra, and Scarborough, R. M., et al., J Biol Chem (1986) 261:12960-12964, which evaluates the ability of ANP analogs to compete with a labeled native ANP for binding to cultured BASM and BAE cells. A similar assay, utilizing labeled pBNP can be used to evaluate candidate BNP family peptides. The pBNP (shown above) was iodinated on the carboxy-terminal Y residue and is identified as (¹²⁵ I)-pBNP. Analogous "competitive displacement" receptor binding assays are considered commonplace in the art for examining specific ligand-receptor interactions.

In this assay, 0.5 nM (¹²⁵ I)-pBNP or (¹²⁵ I)-human NRP is incubated in each individual sample of BASM cells in the presence of varying amounts of unlabeled pBNP or a candidate peptide encoded by a gene hybridizing to pBNP-encoding cDNA.

Increasing concentrations of pBNP, or successful candidate peptide effectively prevent (¹²⁵ I)-pBNP binding to BASM cell-associated receptors. The concentration of unlabeled peptide at which 50% of maximal (¹²⁵ I)-pBNP binding is displaced is called Ki(app) and reflects receptor-binding affinity. Therefore, a peptide with a Ki(app)=100 nM displays substantially weaker interaction with a receptor than peptide with a Ki(app)=10 nM. Assuming these BNP analogs act at one or more BNP receptor sites, then increased receptor affinity should reflect increased biological potency.

The choice of the proper natriuretic peptide for the competition assay above should be made with regard to the peptide of the invention being tested. The receptors utilized by, for example, ANP and BNP may be the same or different. Alternate species forms of either may be used as competitors in suitable assay reactions for candidate peptides of formula (1).

Whole Mammal Bioassays

The biological activity of NP sequences of the present invention (which show activity in the receptor assay above), can be confirmed in anesthetized rats and dogs. The correlation of receptor binding affinity and in vivo effects demonstrates the predictive value of the receptor assays for biological activity.

1. Diuresis and Natriuresis in Anesthetized Rats

In one method, cannulae are placed in the left and right ureters and femoral vein of anesthetized rats and urine is collected from the ureters. NP compositions are administered via the femoral vein. Prior to infusing the NP, saline is infused for 30 minutes, urine is collected for 6 five-minute baseline periods and urine volume is determined gravimetrically.

Following these baseline collection periods, various NPs are infused for 30 or 60 minutes and urine volume is measured in five-minute periods during infusion and for 60 minutes following infusion (at which time rats are returned to saline). Data are examined by averaging urine flow rates for six five-minute baseline control periods immediately preceding infusion, and comparing values during and after administration of NP with the "baseline" control values. Responses to NP are thus evaluated and plotted as the percent of baseline control responses. Responses to peptides that are substantially above baseline±SD can thus be interpreted as being statistically significant increases.

2. Diuresis and Natriuresis in Anesthetized Dogs

The biological activity of NP of the present invention can also be confirmed in pentobarbital-anesthetized dogs. In these examples, cannulae are placed in the left and right ureters or urinary bladder and femoral vein of anesthetized dogs and urine is collected. NP is administered via the femoral vein. Prior to infusing NP, saline is infused for 30 minutes and urine is then collected for three ten-minute collection periods. Urine volume is determined gravimetrically and urine sodium is determined photometrically.

Following these three baseline collection periods, the selected NP is infused for 60 minutes and urinary flow and urinary sodium excretion measured for an additional 60 minutes following infusion. During infusion (60 minutes) and recovery (60 minutes), ten-minute collection periods are obtained. Control animals which received only saline are studied in parallel.

Data are examined by comparing urine flow rates and sodium excretion rates for dogs infused with various NP portions against control animals infused with saline.

Isolated Tissue Bioassays

The effect of NP in vivo may be achieved solely by the ability to potentiate the effect of endogenous NP, through blockage of the receptors involved in binding and clearing endogenous NPs. To the extent that this is the case for a particular NP, it could be expected that the diuretic and natriuretic effects of the NP would be diminished or eliminated in isolated tissue where NPs are not present unless specifically supplied.

Thus, NP compositions fall within the scope of the invention even if their activity in isolated tissue bioassays is low; however, activity in these assays may also be present.

D. Utility and Administration

Briefly, the natriuretic peptides of the invention are useful in treatment of disorders associated with high levels of extracellular fluids such as hypertension. The compounds are administered in conventional formulations for peptides such as those described in Remington's Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa. (latest edition). Preferably, the peptides are administered by injection, preferably intravenously, using appropriate formulations for this route of administration. Dosage levels are on the order of 0.01-100 ug/kg of subject.

These compounds, and compositions containing them, can find use as therapeutic agents in the treatment of various edematous states such as, for example, congestive heart failure, nephrotic syndrome and hepatic cirrhosis, in addition to hypertension and renal failure due to ineffective renal perfusion or reduced glomerular filtration rate. The natriuretic peptides of the invention are particularly effective in the treatment of congestive heart failure.

Thus the present invention also provides compositions containing an effective amount of compounds of the present invention, including the nontoxic addition salts, amides and esters thereof, which may, alone, serve to provide the above-recited therapeutic benefits. Such compositions can also be provided together with physiologically tolerable liquid, gel or solid diluents, adjuvants and excipients.

These compounds and compositions can be administered to mammals for veterinary use, such as with domestic animals, and clinical use in humans in a manner similar to other therapeutic agents. In general, the dosage required for therapeutic efficacy will range from about 0.001 to 100 ug/kg, more usually 0.01 to 100 ug/kg of the host body weight. Alternatively, dosages within these ranges can be administered by constant infusion over an extended period of time, usually exceeding 24 hours, until the desired therapeutic benefits have been obtained.

Typically, such compositions are prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may also be prepared. The preparation may also be emulsified. The active ingredient is often mixed with diluents or excipients which are physiologically tolerable and compatible with the active ingredient. Suitable diluents and excipients are, for example, water, saline, dextrose, glycerol, or the like, and combinations thereof. In addition, if desired the compositions may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, stabilizing or pH-buffering agents, and the like.

The compositions are conventionally administered parenterally, by injection, for example, either subcutaneously or intravenously. Additional formulations which are suitable for other modes of administration include suppositories, intranasal aerosols, and, in some cases, oral formulations. For suppositories, traditional binders and excipients may include, for example, polyalkylene glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10% preferably 1%-2%. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharin, cellulose, magnesium carbonate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained-release formulations, or powders, and contain 10%-95% of active ingredient, preferably 25%-70%.

The peptide compounds may be formulated into the compositions as neutral or salt forms. Pharmaceutically acceptable nontoxic salts include the acid addition salts (formed with the free amino groups) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or organic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.

In addition to the compounds of the present invention which display natriuretic, diuretic or vasorelaxant activity, compounds of the present invention can also be employed as intermediates in the synthesis of such useful compounds. Alternatively, by appropriate selection, compounds of the present invention whose activity levels are reduced or eliminated entirely can serve to modulate the activity of other diuretic, natriuretic or vasorelaxant compounds, including compounds outside the scope of the present invention, by, for example, binding to clearance receptors, stimulating receptor turnover, or providing alternate substrates for degradative enzyme or receptor activity and thus inhibiting these enzymes or receptors. When employed in this manner, such compounds can be delivered as admixtures with other active compounds or can be delivered separately, for example, in their own carriers.

Compounds of the present invention can also be used for preparing antisera for use in immunoassays employing labeled reagents, usually antibodies. Conveniently, the polypeptides can be conjugated to an antigenicityconferring carrier, if necessary, by means of dialdehydes, carbodiimide or using commercially available linkers. These compounds and immunologic reagents may be labeled with a variety of labels such as chromophores, fluorophores such as, e.g., fluorescein or rhodamine, radioisotopes such as ¹²⁵ I, ⁵ S, ¹⁴ C, or ³ H, or magnetized particles, by means well known in the art.

These labeled compounds and reagents, or labeled reagents capable of recognizing and specifically binding to them, can find use as, e.g., diagnostic reagents. Samples derived from biological specimens can be assayed for the presence or amount of substances having a common antigenic determinant with compounds of the present invention. In addition, monoclonal antibodies can be prepared by methods known in the art, which antibodies can find therapeutic use, e.g., to neutralize overproduction of immunologically related compounds in vivo.

Suitable subjects include those animals having conditions of high water or sodium ion accumulation. Both veterinary and therapeutic uses in humans are appropriate.

The following examples are intended to illustrate but not to limit the invention.

EXAMPLE 1 Retrieval of Porcine BNP-Encoding DNA

Porcine heart tissue frozen in liquid nitrogen was obtained and separated roughly into atrial and ventricular portions. Frozen atrial tissue (5 g) was first pulverized in a mortar and pestle, then ground to a powder in a tissuemizer and liquefied in a large volume (25 ml) of 5M guanidinium thiocyanate containing 50 mM Tris (pH 7.5), 5 mM EDTA, and 5% beta-mercaptoethanol. Sarcosyl was then added to 2% and the sample was incubated at 65° C. for 2 min whereupon insoluble material was removed by centrifugation at 7000× g for 10 min. Total RNA was isolated from this supernatant by adding 2.5 g of CsCl, layering over a 10 ml cushion of 5.8M CsCl and centrifuging again at 25,000 rpm for 12 hr. The supernatant was subsequently aspirated and the pellet dissolved in buffer containing Tris (50 mM), EDTA (5 mM), and beta-mercaptoethanol (2%). The RNA solution was then phenol extracted once and precipitated with ethanol. Poly A⁺ RNA was isolated by oligo dT cellulose chromatography.

Double-stranded DNA complementary to the porcine atrial mRNA (5 ug) was synthesized by the RNAse H method. The cDNA was then methylated by standard methods and EcoRI linkers ligated and digested. The entire cDNA was then ligated into previously prepared lambda phage arms and packaged according to standard methods. Plating of this packaging reaction gave a library with ˜1.75×10⁵ randomly isolated phage from this library showed that almost all (>95%) had inserts in the 1-5 kb range.

Probes were designed to detect the pBNP cDNA. As shown in FIG. 2, conservation with respect to human ANP cDNA was assumed in constructing oligo 3351. Thus, the human ANP-encoding sequence shown in the figure was modified only to the extent required to obtain a sequence encoding pBNP. The second oligo, 3352, was another 60-mer designed according to mammalian codon preference and preferring G and T over A and C. The additional 60-mer 3376 was synthesized to match the human ANP sequence so as to eliminate false positives.

Approximately 300,000 phage from the above library were plated and lifted with nitrocellulose filters in duplicate. Series A (15 nitrocellulose filters) were denatured, neutralized, baked for 2 hr and prehybridized for 2 hr in hybridization buffer (20% formamide, 5× Denhardt's solution, 6× SSC, 0.05% pyrophosphate, 100 ug/ml salmon sperm DNA). Labeled oligonucleotide probe (3351, 1.5×10⁷ cpm) was then added and the filters incubated overnight at 42° C. Filters were subsequently washed in 6× SSC, 0.1% SDS at 20° C. for 40 min., and then twice in 1× SSC at 65° C. for 10 min. Series B was treated in the same manner except oligo 3352 was used. Final washing was at 60° C. in this case. In both cases filters were dried and subjected to autoradiography. Approximately 450 positives (0.2% of total clones plated) were obtained when probing with oligo 3351 and most of these are believed to be porcine ANP based on previous screening with this oligo. Four clones in series B hybridized to oligo 3352. These hybrids were stable at 60° C. but not at 65° C. Of these four, only clone 14 hybridized with oligo 3351 also, and this was picked and subjected to another round of purification. The purified phage was then grown, and the DNA was isolated by centrifugation at 36,000 rpm over a CsCl step gradient, phenol extracted, dialyzed, and ethanol precipitated. The phage DNA contained a 1.5 kb DNA insert when subjected to restriction analysis with EcoRI. This insert was then subcloned into an M13 sequencing vector and the sequence determined. The abundance of this BNP mRNA appeared about 400-fold lower than ANP in this library, or 0.0005%.

The DNA sequence of the insert from clone 14 is shown in FIG. 1. The coding region for BNP is present within the clone; however, it is interrupted by what appears to be an intron at residue Val₂₂ of the 26-amino acid BNP. Therefore, it appears that this clone contains an unprocessed mRNA with one or more introns present.

EXAMPLE 2 Identification of the Upstream Intron

As set forth above, the DNA sequence of FIG. 1 shows a change in reading frame and, furthermore, by analogy to the ANP-encoding gene as described by Greenberg et al. (supra), may contain an upstream intron. In order to locate the position of this intron, the sequences surrounding its putative location were used as primers in an amplification procedure to obtain spliced DNA lacking the intron.

Poly A⁺ RNAs were isolated from porcine atrial tissue using the guanidinium isothiocyanate method of Chirgwin, J. M., Biochemistry (1979) 18:5294-5299, followed by oligo-dT cellulose chromatography. Approximately 2 ug of the porcine atrial mRNA was incubated with 400 ng of oligonucleotide 3895 (supra) as primer in a 20 ul reaction containing 0.5 mM dNTPs, 50 mM Tris-HCl, pH 8.3, 10 mM magnesium chloride, 10 units of RNasin, and 50 units of reverse transcriptase. Subsequent amplification of the resulting DNA was performed as described by Saiki, R. K., Science (1988) 239:487-491. After incubation for 1 hr at 37° C., half of the reaction was diluted to 100 ul in 67 mM Tris-HCl, pH 8.8, 6.7 mM magnesium chloride, 16.6 mM ammonium sulfate, 10 mM mercaptoethanol, 6.7 uM EDTA, 1 mM dNTPs, 10% DMSO, and 400 ng of each primer oligonucleotide. The oligonucleotide primers were those corresponding to bases 100-123 (identity strand) and 652-685 (complementary strand) of the pBNP clone shown in FIG. 1.

The reaction mixture was denatured by boiling for 5 min followed by incubation at 42° C. to allow primer annealing. Thermus aguaticus polymerase (3 units) was added and the sample further incubated at 72° C. for 3 min. The cycle (98° C., 1 min; 43° C., 1 min; 72° C., 3 min) was repeated 30 times without addition of additional polymerase. The extended reaction was conducted under a 100 ul layer of mineral oil, and a 10 ul aliquot was removed and analyzed by standard agarose gel electrophoresis. The resulting DNA fragments were visualized after staining with ethidium bromide and purified by preparative agarose gel electrophoresis. The amplified DNA fragment was then kinased, ligated into M13, and sequenced.

Two DNA sequences resulted from this reaction: one of approximate 650 bp was in low relative amount and was presumably the unspliced version; the more abundant, approximately 350 bp, band was assumed to be the fully processed DNA and, indeed, showed the sequence set forth in FIG. 4 between the reverse arrows at positions 100-684.

EXAMPLE 3 Retrieval of Additional BNP Encoding Genes

The entire 1504-base sequence in FIG. 1, or a shorter segment constituting bases 601-1300 is then used as a probe to obtain the genes encoding the analogous BNP peptides and other vertebrate species. In this approach, blots of genomic DNA are used as substrate. Approximately 10 ug of genomic DNA from the liver of the appropriate species is digested with BamHI or PstI overnight, and the digested DNA precipitated with ethanol and electrophoresed on 0.8% agarose gels. The gels are blotted onto nitrocellulose filters overnight, and the filters then de-natured, baked and prehybridized at 42° C. in prehybridization buffer (20% formamide, 5× Denhardt's 6× SSC, 100 mg/ml RNA, 0.05% sodium pyrophosphate).

The cDNA is labeled by nick-translation and hybridized to two panels from the same gel at 42° C. overnight in the prehybridization buffer. The filters are then washed in 1× SSC, 0.1% SDS at 60° C. and 65° C. and exposed to autoradiographic film.

The genes encoding the analogous BNPs in the particular species are then amplified and sequenced according to standard techniques. The deduced sequences can be manipulated to provide suitable restriction sites for insertion into expression systems and to provide desired stop and start codons by site-directed mutagenesis.

EXAMPLE 4 Alternative Method to Obtain Natriuretic Peptide-Encoding DNAs

In a modification of the method set forth in Example 3, both the canine and human genomic libraries yielded DNAs encoding natriuretic peptides.

Genomic DNA from pig, rat, dog, cat, rabbit and human organisms were probed on Southern blots using the cDNA illustrated in FIG. 1 herein under two different hybridization conditions:

(1) 50% formamide, 6× SSC, 5× Denhardt's, 10 mM sodium phosphate, 10 ug/ml sheared DNA at 42° C.; and

(2) 20% formamide, 6× SSC, 5× Denhardt's, 10 mM sodium phosphate, 10 ug/ml sheared DNA at 37° C.

Washing in both cases was in 1× SSC, 0.1% SDS at 50-60° C. for 1 hr.

A dog genomic library obtained from Clontech Inc. yielded 2 clones under the condition (1) above and the DNA from these identified clones was digested with HaeIII or AluI and subcloned into M13. The resulting plaques were screened for hybridization to the porcine probe, and positive clones were sequenced. The identity of the clone was confirmed by detection of the BNP-encoding sequence of FIG. 1, and the 2.9 kb HindIII fragment containing the entire gene was then subcloned into pBR322, and designated pdBNP-1. The DNA sequence of the portion of this clone encoding the BNP gene is shown in FIG. 5, and the pdBNP-1 plasmid was deposited at the American Type Culture Collection on Dec. 14, 1988 and has Accession No. ATCC-67862.

Although a human genomic library failed to yield signals corresponding to hybridization with the probe using the porcine DNA of FIG. 1, use of pdBNP-1 as a probe under condition (1) above produced several distinct bands that could be visualized in blots of digested human genomic DNA, as shown in FIG. 6. A preparative agarose gel was utilized to isolate EcoRI-digested human genomic DNA in the 6-7 kb size range, which isolated DNA was then cloned into lambda-ZAP2 (Strategene Inc.), packaged, and the resulting mini-library was screened using the hybridization condition (1) above. Seven positive signals were purified and the insert subcloned into pBLUSCRIPT vector. The sequences of the M13 subclones of hybridization-positive HaeIII and AluI-digested plasmid DNA were determined. The sequence of the coding region of the plasmid, phBNP-1, is shown in FIG. 7 and the plasmid was deposited at the American Type Culture Collection on Dec. 14, 1988 with Accession No. ATCC-67863.

Using the intron splice junction consensus sequences described by Mount, S., Nucleic Acids Res (1982) 10:459-472, it appears that the first exon of the human cDNA sequence contains two extra amino acids in the BNP precursor region as compared to the porcine sequence. This can be verified by PCR amplification of human atrial RNA.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES: 50                                             - (2) INFORMATION FOR SEQ ID NO:1:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 26 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                  - Gly Ser Gly Cys Phe Gly Arg Lys Met Asp Ar - #g Ile Ser Ser Ser Ser          #                15                                                            - Gly Leu Gly Cys Lys Val Leu Arg Arg His                                      #            25                                                                - (2) INFORMATION FOR SEQ ID NO:2:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 25 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                  - Arg Ser Ser Cys Phe Gly Gly Arg Met Asp Ar - #g Ile Gly Ala Gln Ser          #                15                                                            - Gly Leu Gly Cys Asn Ser Phe Arg Tyr                                          #            25                                                                - (2) INFORMATION FOR SEQ ID NO:3:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 26 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                  - Asp Ser Gly Cys Phe Gly Arg Arg Leu Asp Ar - #g Ile Gly Ser Leu Ser          #                15                                                            - Gly Leu Gly Cys Asn Val Leu Arg Arg Tyr                                      #            25                                                                - (2) INFORMATION FOR SEQ ID NO:4:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 6 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                  - Ser Pro Lys Thr Met Arg                                                       1               5                                                             - (2) INFORMATION FOR SEQ ID NO:5:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 17 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 5                                                      #Amino Acid is Arg or Lys.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 6                                                      #Amino Acid is Leu or Met.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 10                                                     #Amino Acid is Gly or Ser.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 12                                                     #Amino Acid is Leu or Ser.ATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                  - Cys Phe Gly Arg Xaa Xaa Asp Arg Ile Xaa Se - #r Xaa Ser Gly Leu Gly          #                15                                                            - Cys                                                                          - (2) INFORMATION FOR SEQ ID NO:6:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 4 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 1                                                      #Amino Acid is His, Arg or Gln.:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 2                                                      #Amino Acid is Lys, Asp or Gly.:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                  - Xaa Xaa Ser Gly                                                               1                                                                             - (2) INFORMATION FOR SEQ ID NO:7:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 5 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 1                                                      #Amino Acid is Met or Val.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 2                                                      #Amino Acid is His, Arg or Gln.:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 3                                                      #Amino Acid is Lys, Asp or Gly.:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                  - Xaa Xaa Xaa Ser Gly                                                           1               5                                                             - (2) INFORMATION FOR SEQ ID NO:8:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 6 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 1                                                      #Amino Acid is Thr or Met.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 2                                                      #Amino Acid is Met or Val.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 3                                                      #Amino Acid is His, Arg or Gln.:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 4                                                      #Amino Acid is Lys, Asp or Gly.:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                  - Xaa Xaa Xaa Xaa Ser Gly                                                       1               5                                                             - (2) INFORMATION FOR SEQ ID NO:9:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 7 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 2                                                      #Amino Acid is Thr or Met.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 3                                                      #Amino Acid is Met or Val.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 4                                                      #Amino Acid is His, Arg or Gln.:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 5                                                      #Amino Acid is Lys, Asp or Gly.:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                  - Lys Xaa Xaa Xaa Xaa Ser Gly                                                   1               5                                                             - (2) INFORMATION FOR SEQ ID NO:10:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 8 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 3                                                      #Amino Acid is Thr or Met.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 4                                                      #Amino Acid is Met or Val.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 5                                                      #Amino Acid is His, Arg or Gln.:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 6                                                      #Amino Acid is Lys, Asp or Gly.:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                 - Pro Lys Xaa Xaa Xaa Xaa Ser Gly                                               1               5                                                             - (2) INFORMATION FOR SEQ ID NO:11:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 9 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 4                                                      #Amino Acid is Thr or Met.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 5                                                      #Amino Acid is Met or Val.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 6                                                      #Amino Acid is His, Arg or Gln.:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 7                                                      #Amino Acid is Lys, Asp or Gly.:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                 - Ser Pro Lys Xaa Xaa Xaa Xaa Ser Gly                                           1               5                                                             - (2) INFORMATION FOR SEQ ID NO:12:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 4 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 1                                                      #Amino Acid is Asn or Lys.ATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                 - Xaa Val Leu Arg                                                               1                                                                             - (2) INFORMATION FOR SEQ ID NO:13:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 5 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 1                                                      #Amino Acid is Asn or Lys.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 5                                                      #Amino Acid is Arg or Lys.ATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                 - Xaa Val Leu Arg Xaa                                                           1               5                                                             - (2) INFORMATION FOR SEQ ID NO:14:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 6 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (ix) FEATURE:                                                                      (A) NAME/KEY: Other                                                            (B) LOCATION: 1                                                      #Amino Acid is Asn or Lys.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 5                                                      #Amino Acid is Arg or Lys.ATION:                                                         (A) NAME/KEY: Other                                                            (B) LOCATION: 6                                                      #Amino Acid is Tyr or His.ATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                 - Xaa Val Leu Arg Xaa Xaa                                                       1               5                                                             - (2) INFORMATION FOR SEQ ID NO:15:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 17 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                 - Cys Phe Gly Arg Arg Leu Asp Arg Ile Gly Se - #r Leu Ser Gly Leu Gly          #                15                                                            - Cys                                                                          - (2) INFORMATION FOR SEQ ID NO:16:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 6 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                 - Asn Val Leu Arg Arg Tyr                                                       1               5                                                             - (2) INFORMATION FOR SEQ ID NO:17:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 1504 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: Coding Se - #quence                                              (B) LOCATION: 100...630                                                        (D) OTHER INFORMATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                 - AATTCCAGG CTGCTAGGAA GTGAAAAGTG AACCTGGACC CAGCTCAGCG GCA - #GCAGCAG           60                                                                           - CGGCAGCAGG CAGCAGCCTC TATCCTCTCC TCCAGCCAC ATG GGC CCC - # CGG ATG            114                                                                           #       Met Gly Pro Arg Met                                                    #     5  1                                                                     - GCG CTT CCC CGC GTG CTC CTG CTC CTG TTC TT - #G CAC CTG TTG CTG CTA           162                                                                           Ala Leu Pro Arg Val Leu Leu Leu Leu Phe Le - #u His Leu Leu Leu Leu            #                20                                                            - GGA TGC CGT TCC CAT CCA CTG GGT GGC GCT GG - #C CTG GCC TCA GAA CTG           210                                                                           Gly Cys Arg Ser His Pro Leu Gly Gly Ala Gl - #y Leu Ala Ser Glu Leu            #            35                                                                - CCA GGG ATA CAG GTG AGC CCT GAT GAA CTG CT - #T AGA CTT GGT TGG CTG           258                                                                           Pro Gly Ile Gln Val Ser Pro Asp Glu Leu Le - #u Arg Leu Gly Trp Leu            #        50                                                                    - GGA GGG CGC GGA CAG CAG CAA CTA ACG GGT CC - #C CAC CTA CTG TTC CAA           306                                                                           Gly Gly Arg Gly Gln Gln Gln Leu Thr Gly Pr - #o His Leu Leu Phe Gln            #    65                                                                        - GAG GGC TCT AAC CTC CTT TGG GAA CTA GTG AT - #A AGG GGT TTA GAA GGC           354                                                                           Glu Gly Ser Asn Leu Leu Trp Glu Leu Val Il - #e Arg Gly Leu Glu Gly            #85                                                                            - AGC CAG GCT GGG GGT GAG GAC CCG CTC CCA AG - #G CAG TTG GTT CGC TTC           402                                                                           Ser Gln Ala Gly Gly Glu Asp Pro Leu Pro Ar - #g Gln Leu Val Arg Phe            #                100                                                           - AGC ACC ATC AAG AGT GAT GGG TCC AGG TGC GA - #G TTC CTG AGG CTC GGG           450                                                                           Ser Thr Ile Lys Ser Asp Gly Ser Arg Cys Gl - #u Phe Leu Arg Leu Gly            #           115                                                                - CTC CCC CAC CCA TCC CAG GAG CTG CTG GAC CG - #C CTG CGA GAC AGG GTC           498                                                                           Leu Pro His Pro Ser Gln Glu Leu Leu Asp Ar - #g Leu Arg Asp Arg Val            #       130                                                                    - TCC GAG CTG CAG GCG ACG GGA CGG ACC TGG AG - #C CCC TCC GGC AGG ACC           546                                                                           Ser Glu Leu Gln Ala Thr Gly Arg Thr Trp Se - #r Pro Ser Gly Arg Thr            #   145                                                                        - GTG GCC TCA CAG AAG CCT GGG AGG CGA GGG AA - #G CAG CCC CCA CGG GGG           594                                                                           Val Ala Ser Gln Lys Pro Gly Arg Arg Gly Ly - #s Gln Pro Pro Arg Gly            150                 1 - #55                 1 - #60                 1 -        #65                                                                            - TTC TTG GGC CCC GCA GTA GCA TCT TCC AAG TC - #C TCC GGGGAATACG CAGCCC         646                                                                           Phe Leu Gly Pro Ala Val Ala Ser Ser Lys Se - #r Ser                            #               175                                                            - CAAGACGATG CGTGACTCTG GCTGCTTTGG GCGGAGGCTG GACCGGATCG GC - #TCCCTCAG         706                                                                           - CGGCCTGGGC TGCAATGGTG AGCACCCACC CCATTCCCAC TGCACGCCCC GG - #TTAGCATC         766                                                                           - ACTTCTGGGT TTGATGTCTC TGGGACCAAA CTCCGAGAAA AGGACACCTG GA - #TATCACTC         826                                                                           - TTTCTTGTTG CCAGTCCTCA AGGCCAAGGA GCGCCTTCCT GGAAAAATTA AA - #TTTGGACA         886                                                                           - GCATTCACTA GCATGACTAT GAGTCCCCAC CCACCTTCTC GCCACCCCCT GC - #CTCTCTCA         946                                                                           - CCCAAGGCGG CAGAATTACT TTAGGATGTA AATTCTGTCA TTGCCTGGCT GC - #CGCTCCTG        1006                                                                           - GGAGCAAAAA GAGAACTAAA CCTCTTCCCC CTGGTTTCCC CTCAACTGTC TG - #TGGCTGCA        1066                                                                           - AAGGCAGAGG GCAGGATCAC CAGGGTGATG ACAAGTCCCA GCTTACAAGG AG - #GAAACTCA        1126                                                                           - GGTCCAGAGA GATGGATTAT CCCAAAGCCC CAAACATCCA GTTCTGCTGA AG - #AAGGCGGG        1186                                                                           - TGGCAGGGGT GGCACGTGGT GGGGGGAAGC CCAGGTCCTG CCTGCCTCTC AC - #CCTAATGT        1246                                                                           - CATCCTCACC CTCTCTCTCC CCCCCACAGT GCTCAGGAGG TACTGAGAAG TC - #CTGGCTGA        1306                                                                           - CAACCTCTGT GTCCGCTTCT CCAACGCCCC TCCCCTGCTC CCCTTCAAAG CA - #ACTCCTGT        1366                                                                           - TTTTATTTAT GTATTTATTT ATTTATTTAT TTGGTGGTTG TATATAAGAC GG - #TTCTTATT        1426                                                                           - TGTGAGCACA TTTTTTCCAT GGTGAAATAA AGTCAACATT AGAGCTCTGT CT - #TTTGAAAA        1486                                                                           #1504              TC                                                          - (2) INFORMATION FOR SEQ ID NO:18:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 177 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                 - Met Gly Pro Arg Met Ala Leu Pro Arg Val Le - #u Leu Leu Leu Phe Leu          #                15                                                            - His Leu Leu Leu Leu Gly Cys Arg Ser His Pr - #o Leu Gly Gly Ala Gly          #            30                                                                - Leu Ala Ser Glu Leu Pro Gly Ile Gln Val Se - #r Pro Asp Glu Leu Leu          #        45                                                                    - Arg Leu Gly Trp Leu Gly Gly Arg Gly Gln Gl - #n Gln Leu Thr Gly Pro          #    60                                                                        - His Leu Leu Phe Gln Glu Gly Ser Asn Leu Le - #u Trp Glu Leu Val Ile          #80                                                                            - Arg Gly Leu Glu Gly Ser Gln Ala Gly Gly Gl - #u Asp Pro Leu Pro Arg          #                95                                                            - Gln Leu Val Arg Phe Ser Thr Ile Lys Ser As - #p Gly Ser Arg Cys Glu          #           110                                                                - Phe Leu Arg Leu Gly Leu Pro His Pro Ser Gl - #n Glu Leu Leu Asp Arg          #       125                                                                    - Leu Arg Asp Arg Val Ser Glu Leu Gln Ala Th - #r Gly Arg Thr Trp Ser          #   140                                                                        - Pro Ser Gly Arg Thr Val Ala Ser Gln Lys Pr - #o Gly Arg Arg Gly Lys          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Gln Pro Pro Arg Gly Phe Leu Gly Pro Ala Va - #l Ala Ser Ser Lys Ser          #               175                                                            - Ser                                                                          - (2) INFORMATION FOR SEQ ID NO:19:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 13 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                 - Pro Ala Cys Ser Cys Ser Cys Ser Cys Thr Cy - #s Cys Cys                      #                10                                                            - (2) INFORMATION FOR SEQ ID NO:20:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 20 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                 - Asp Ala Val Pro Ile His Trp Val Ala Leu Al - #a Trp Pro Gln Asn Cys          #                 15                                                           - Gln Gly Tyr Arg                                                                           20                                                                - (2) INFORMATION FOR SEQ ID NO:21:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 18 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                 - Ala Leu Met Asn Cys Leu Asp Leu Val Gly Tr - #p Glu Gly Ala Asp Ser          #                15                                                            - Ser Asn                                                                      - (2) INFORMATION FOR SEQ ID NO:22:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 16 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                 - Arg Val Pro Thr Tyr Cys Ser Lys Arg Ala Le - #u Thr Ser Phe Gly Asn          #                15                                                            - (2) INFORMATION FOR SEQ ID NO:23:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 30 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                 - Lys Ala Ala Arg Leu Gly Val Arg Thr Arg Se - #r Gln Gly Ser Trp Phe          #                 15                                                           - Ala Ser Ala Pro Ser Arg Val Met Gly Pro Gl - #y Ala Ser Ser                  #             30                                                               - (2) INFORMATION FOR SEQ ID NO:24:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 56 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                 - Gly Ser Gly Ser Pro Thr His Pro Arg Ser Cy - #s Trp Thr Ala Cys Glu          #                15                                                            - Thr Gly Ser Pro Ser Cys Arg Arg Arg Asp Gl - #y Pro Gly Ala Pro Pro          #            30                                                                - Ala Gly Pro Trp Pro His Arg Ser Leu Gly Gl - #y Glu Gly Ser Ser Pro          #        45                                                                    - His Gly Gly Ser Trp Ala Pro Gln                                              #    55                                                                        - (2) INFORMATION FOR SEQ ID NO:25:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 6 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                 - His Leu Pro Ser Pro Pro                                                       1               5                                                             - (2) INFORMATION FOR SEQ ID NO:26:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 36 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                 - Pro Arg Ala Pro Ala Pro Val Leu Ala Pro Va - #l Ala Ala Arg Met Pro          #                15                                                            - Phe Pro Ser Thr Gly Trp Arg Trp Pro Gly Le - #u Arg Thr Ala Arg Asp          #            30                                                                - Thr Gly Glu Pro                                                                      35                                                                     - (2) INFORMATION FOR SEQ ID NO:27:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 23 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                 - Thr Trp Leu Ala Gly Arg Ala Arg Thr Ala Al - #a Thr Asn Gly Ser Pro          #                15                                                            - Pro Thr Val Pro Arg Gly Leu                                                              20                                                                 - (2) INFORMATION FOR SEQ ID NO:28:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 17 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                 - Pro Pro Leu Gly Thr Ser Asp Lys Gly Phe Ar - #g Arg Gln Pro Gly Trp          #                15                                                            - Gly                                                                          - (2) INFORMATION FOR SEQ ID NO:29:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 15 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                 - Gly Pro Ala Pro Lys Ala Val Gly Ser Leu Gl - #n His His Gln Glu              #                15                                                            - (2) INFORMATION FOR SEQ ID NO:30:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 106 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                 - Trp Val Gln Val Arg Val Pro Glu Ala Arg Al - #a Pro Pro Pro Ile Pro          #                15                                                            - Gly Ala Ala Gly Pro Pro Ala Arg Gln Gly Le - #u Arg Ala Ala Gly Asp          #            30                                                                - Gly Thr Asp Leu Glu Pro Leu Arg Gln Asp Ar - #g Gly Leu Thr Glu Ala          #        45                                                                    - Trp Glu Ala Arg Glu Ala Ala Pro Thr Gly Va - #l Leu Gly Pro Arg Ser          #    60                                                                        - Ser Ile Phe Gln Val Leu Arg Gly Ile Arg Se - #r Pro Lys Thr Met Arg          #80                                                                            - Asp Ser Gly Cys Phe Gly Arg Arg Leu Asp Ar - #g Ile Gly Ser Leu Ser          #                95                                                            - Gly Leu Gly Cys Asn Val Leu Arg Arg Tyr                                      #           105                                                                - (2) INFORMATION FOR SEQ ID NO:31:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 60 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: Coding Se - #quence                                              (B) LOCATION: 1...60                                                           (D) OTHER INFORMATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                 - TCC AGC TGC TTC GGG GGC AGG ATG GAC AGG AT - #T GGA GCC CAG AGC GGA            48                                                                           Ser Ser Cys Phe Gly Gly Arg Met Asp Arg Il - #e Gly Ala Gln Ser Gly            #                15                                                            #       60                                                                     Leu Gly Cys Asn                                                                            20                                                                 - (2) INFORMATION FOR SEQ ID NO:32:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 20 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ix) FEATURE:                                                            -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                 - Ser Ser Cys Phe Gly Gly Arg Met Asp Arg Il - #e Gly Ala Gln Ser Gly          #                15                                                            - Leu Gly Cys Asn                                                                          20                                                                 - (2) INFORMATION FOR SEQ ID NO:33:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 60 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:                                 - ACNGGNTGCT TGGGNCGNCG NCTNGACCGN ATNGGNTCNC TNTCNGGNCT NG - #GNTGCAAC          60                                                                           - (2) INFORMATION FOR SEQ ID NO:34:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 20 amino                                                           (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:                                 - Ser Gly Cys Phe Gly Arg Arg Leu Asp Arg Il - #e Gly Ser Leu Ser Gly          #                15                                                            - Leu Gly Cys Asn                                                                          20                                                                 - (2) INFORMATION FOR SEQ ID NO:35:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 60 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:                                 - AGGCCGACGA AGCCCGCGTC CGACCTGTCC TAACCTAGGG ACTCGCCTGA CC - #CGACATTG          60                                                                           - (2) INFORMATION FOR SEQ ID NO:36:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 60 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:                                 - TCGCCGACGA AGCCGTCTTC TGAGCTGTCT TAGCCGTCGG AGTCGCCGGA GC - #CGACGTTG          60                                                                           - (2) INFORMATION FOR SEQ ID NO:37:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 60 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:                                 - AGGTCGACGA AGCCCCCGTC CTACCTGTCC TAACCTCGGG TCTCGCCTGA CC - #CGACATTG          60                                                                           - (2) INFORMATION FOR SEQ ID NO:38:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 1507 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 100...222                                                        (D) OTHER INFORMATION:                                                         (A) NAME/KEY: CDS                                                              (B) LOCATION: 469...723                                                        (D) OTHER INFORMATION:                                                         (A) NAME/KEY: CDS                                                              (B) LOCATION: 1281...1292                                                      (D) OTHER INFORMATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:                                 - GAATTCCAGG CTGCTAGGAA GTGAAAAGTG AACCTGGACC CAGCTCAGCG GC - #AGCAGCAG          60                                                                           - CGGCAGCAGG CAGCAGCCTC TATCCTCTCC TCCAGCCACA TGGGCCCCCG GA - #TGGCGCTT         120                                                                           - CCCCGCGTGC TCCTGCTCCT GTTCTTGCAC CTGTTGCTGC TAGGATGCCG TT - #CCCATCCA         180                                                                           - CTGGGTGGCG CTGGCCTGGC CTCAGAACTG CCAGGGATAC AGGTGAGCCC TG - #ATGAACTG         240                                                                           - CTTAGACTTG GTTGGCTGGG AGGGCGCGGA CAGCAGCAAC TAACGGGTCC CC - #ACCTACTG         300                                                                           - TTCCAAGAGG GCTCTAACCT CCTTTGGGAA CTAGTGATAA GGGGTTAGAA GG - #CAGCCAGG         360                                                                           - CTGGGGGTGA GGACCCCGCT CCCAAGGCAG TTGGTTCGCT TCAGCACCAT CA - #AGAGTGAT         420                                                                           - GGGTCCAGGT GCGAGTTCCT GAGGCTCGGG CTCCCCCACC CATCCCAGGA GC - #TGCTGGAC         480                                                                           - CGCCTGCGAG ACAGGGTCTC CGAGCTGCAG GCGGAGCGGA CGGACCTGGA GC - #CCCTCCGG         540                                                                           - CAGGACCGTG GCCTCACAGA AGCCTGGGAG GCGAGGGAAG CAGCCCCCAC GG - #GGGTTCTT         600                                                                           - GGGCCCCGCA GTAGCATCTT CCAAGTCCTC CGGGGAATAC GCAGCCCCAA GA - #CGATGCGT         660                                                                           - GACTCTGGCT GCTTTGGGCG GAGGCTGGAC CGGATCGGCT CCCTCAGCGG CC - #TGGGCTGC         720                                                                           - AATGGTGAGC ACCCACCCCC ATTCCCACTG CACGCCCCGG TTAGCATCAC TT - #CTGGGTTT         780                                                                           - GATGTCTCTG GGGACCAAAC TCCGAGAAAA GGACACCTGG ATATCACTCT TT - #CTTGTTGC         840                                                                           - CAGTCCTCAA GGCCAAGGAG CGCCTTCCTG GAAAAATTAA ATTTGGACAG CA - #TTCACTAG         900                                                                           - CATGACTATG AGTCCCCACC CACCTTCTCG CCACCCCCTG CCTCTCTCAC CC - #AAGGCGGC         960                                                                           - AGAATTACTT TAGGATGTAA ATTCTGTCAT TGCCTGGCTG CCGCTCCTGG GA - #GCAAAAAG        1020                                                                           - AGAACTAAAC CTCTTCCCCC TGGTTTCCCC TCAACTGTCT GTGGCTGCAA AG - #GCAGAGGG        1080                                                                           - CAGGATCACC AGGGTGATGA CAAGTCCCAG CTTACAAGGA GGAAACTCAG GT - #CCAGAGAG        1140                                                                           - ATGGATTATC CCAAAGCCCC AAACATCCAG TTCTGCTGAA GAAGGCGGGT GG - #CAGGGGTG        1200                                                                           - GCACGTGGTG GGGGGAAGCC CAGGTCCTGC CTGCCTCTCA CCCTAATGTC AT - #CCTCACCC        1260                                                                           - TCTCTCTCCC CCCCACAGTG CTCAGGAGGT ACTGAGAAGT CCTGGCTGAC AA - #CCTCTGTG        1320                                                                           - TCCGCTTCTC CAACGCCCCT CCCCTGCTCC CCTTCAAAGC AACTCCTGTT TT - #TATTTATG        1380                                                                           - TATTTATTTA TTTATTTATT TGGTGGTTGT ATATAAGACG GTTCTTATTT GT - #GAGCACAT        1440                                                                           - TTTTTCCATG GTGAAATAAA GTCAACATTA GAGCTCTGTC TTTTGAAAAA AA - #AAAAAAAA        1500                                                                           #        1507                                                                  - (2) INFORMATION FOR SEQ ID NO:39:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 131 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:                                 - Met Gly Pro Arg Met Ala Leu Pro Arg Val Le - #u Leu Leu Leu Phe Leu          #                15                                                            - His Leu Leu Leu Leu Gly Cys Arg Ser His Pr - #o Leu Gly Gly Ala Gly          #            30                                                                - Leu Ala Ser Glu Leu Pro Gly Ile Gln Glu Le - #u Leu Asp Arg Leu Arg          #        45                                                                    - Asp Arg Val Ser Glu Leu Gln Ala Glu Arg Th - #r Asp Leu Glu Pro Leu          #    60                                                                        - Arg Gln Asp Arg Gly Leu Thr Glu Ala Trp Gl - #u Ala Arg Glu Ala Ala          #80                                                                            - Pro Thr Gly Val Leu Gly Pro Arg Ser Ser Il - #e Phe Gln Val Leu Arg          #                95                                                            - Gly Ile Arg Ser Pro Lys Thr Met Arg Asp Se - #r Gly Cys Phe Gly Arg          #           110                                                                - Arg Leu Asp Arg Ile Gly Ser Leu Ser Gly Le - #u Gly Cys Asn Val Leu          #       125                                                                    - Arg Arg Tyr                                                                      130                                                                        - (2) INFORMATION FOR SEQ ID NO:40:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 707 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 100...492                                                        (D) OTHER INFORMATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:                                 - GAATTCCAGG CTGCTAGGAA GTGAAAAGTG AACCTGGACC CAGCTCAGCG GC - #AGCAGCAG          60                                                                           - CGGCAGCAGG CAGCAGCCTC TATCCTCTCC TCCAGCCACA TGGGCCCCCG GA - #TGGCGCTT         120                                                                           - CCCCGCGTGC TCCTGCTCCT GTTCTTGCAC CTGTTGCTGC TAGGATGCCG TT - #CCCATCCA         180                                                                           - CTGGGTGGCG CTGGCCTGGC CTCAGAACTG CCAGGGATAC AGGAGCTGCT GG - #ACCGCCTG         240                                                                           - CGAGACAGGG TCTCCGAGCT GCAGGCGGAG CGGACGGACC TGGAGCCCCT CC - #GGCAGGAC         300                                                                           - CGTGGCCTCA CAGAAGCCTG GGAGGCGAGG GAAGCAGCCC CCACGGGGGT TC - #TTGGGCCC         360                                                                           - CGCAGTAGCA TCTTCCAAGT CCTCCGGGGA ATACGCAGCC CCAAGACGAT GC - #GTGACTCT         420                                                                           - GGCTGCTTTG GGCGGAGGCT GGACCGGATC GGCTCCCTCA GCGGCCTGGG CT - #GCAATGTG         480                                                                           - CTCAGGAGGT ACTGAGAAGT CCTGGCTGAC AACCTCTGTG TCCGCTTCTC CA - #ACGCCCCT         540                                                                           - CCCCTGCTCC CCTTCAAAGC AACTCCTGTT TTTATTTATG TATTTATTTA TT - #TATTTATT         600                                                                           - TGGTGGTTGT ATATAAGACG GTTCTTATTT GTGAGCACAT TTTTTCCATG GT - #GAAATAAA         660                                                                           #               707TGTC TTTTGAAAAA AAAAAAAAAA GGAATTC                          - (2) INFORMATION FOR SEQ ID NO:41:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 131 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:                                 - Met Gly Pro Arg Met Ala Leu Pro Arg Val Le - #u Leu Leu Leu Phe Leu          #                15                                                            - His Leu Leu Leu Leu Gly Cys Arg Ser His Pr - #o Leu Gly Gly Ala Gly          #            30                                                                - Leu Ala Ser Glu Leu Pro Gly Ile Gln Glu Le - #u Leu Asp Arg Leu Arg          #        45                                                                    - Asp Arg Val Ser Glu Leu Gln Ala Glu Arg Th - #r Asp Leu Glu Pro Leu          #    60                                                                        - Arg Gln Asp Arg Gly Leu Thr Glu Ala Trp Gl - #u Ala Arg Glu Ala Ala          #80                                                                            - Pro Thr Gly Val Leu Gly Pro Arg Ser Ser Il - #e Phe Gln Val Leu Arg          #                95                                                            - Gly Ile Arg Ser Pro Lys Thr Met Arg Asp Se - #r Gly Cys Phe Gly Arg          #           110                                                                - Arg Leu Asp Arg Ile Gly Ser Leu Ser Gly Le - #u Gly Cys Asn Val Leu          #       125                                                                    - Arg Arg Tyr                                                                      130                                                                        - (2) INFORMATION FOR SEQ ID NO:42:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 1804 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 365...490                                                        (D) OTHER INFORMATION:                                                         (A) NAME/KEY: CDS                                                              (B) LOCATION: 740...791                                                        (D) OTHER INFORMATION:                                                         (A) NAME/KEY: CDS                                                              (B) LOCATION: 1558...1569                                                      (D) OTHER INFORMATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:                                 - CGATCAGGGA TGTTGGGGCG GAGGAAACGG AGGGAAGGAG GGAGCGGAGG AG - #GCCCGAGG          60                                                                           - ACTGTTGGTG TCCCCCTCCT GCCCTTTTGG GGCCAGGCCC ACTTCTATAC AA - #GGCCTGCT         120                                                                           - CTCCAGCCTC CACCCCGGCG GGTATGGTGC AGGCGCGGAG GGGCGCATTC CC - #CCGCCCTG         180                                                                           - AGCTCAGCGG CCGGAATGCG GCCGATAAAT CAGAGATAAC CCCAGGCGCG GG - #ATAAGGGA         240                                                                           - TAAAAAGCCC CCGTTGCCGC GGGATCCAGG AGAGCACCCG CGCCCCAAGC GG - #TGACACTC         300                                                                           - GACCCCGGTC GCAGCGCAGC AGCTCAGCAG CCGGACGTCT CTTTCCCCAC TT - #CTCTCCAG         360                                                                           - CGACATGGAG CCCTGCGCAG CGCTGCCCCG GGCCCTCCTG CTCCTCCTGT TC - #TTGCACCT         420                                                                           - GTCGCCACTC GGAGGCCGCC CCCACCCGCT GGGCGGCCGC AGCCCCGCCT CG - #GAAGCCTC         480                                                                           - GGAAGCCTCA GAAGCCTCGG GGTTGTGGGC CGTGCAGGTG AGCGCTCAGC CT - #GCCTGAAG         540                                                                           - GCCGCGGCGG GTGGCAGCAG GTCACGGGGG CTTAGCCACT GTCCCAAGTC CT - #CAGTCTCC         600                                                                           - CTTGGGAATT AGTGATAAGG GAATCAGAAA GTGACGAGAT TGGGTGCCAG GA - #CTCCATAC         660                                                                           - CCAAGGCGGC GGCTTCACTT GGGTGCAAGG GTGGTTCCGC CCCGGCGTGG GT - #TCCTGAGG         720                                                                           - CTCAGGCCGT CCATTGCAGG AGCTGCTGGG CCGTCTGAAG GACGCAGTTT CA - #GAGCTGCA         780                                                                           - GGCAGAGCAG TTGGCCCTGG AACCCCTGCA CCGGAGCCAC AGCCCCGCAG AA - #GCCCCGGA         840                                                                           - GGCCGGAGGA ACGCCCCGTG GGGTCCTTGC ACCCCATGAC AGTGTCCTCC AG - #GCCCTGAG         900                                                                           - AAGACTACGC AGCCCCAAGA TGATGCACAA GTCAGGGTGC TTTGGCCGGA GG - #CTGGACCG         960                                                                           - GATCGGCTCC CTCAGTGGCC TGGGCTGCAA TGGTAAGCCG CCTCCCTGCC GC - #CTTGGCTC        1020                                                                           - CCCCTCCCCA GCCCCCTGGG TTCGACCCTT GGAACCCCTT CTGGGTTTGT TG - #TCTCGGGG        1080                                                                           - GATCACACTC TGAGGAAAGG ACATCTGGAC ATCGCTCCTT CTTGCTGACA GT - #CCTAAGGG        1140                                                                           - CCAAGGAGTA CGTTTCTGGA AATACTACGT GTGGACATCG TTGTCCAGGG TC - #CCTACCCA        1200                                                                           - CCTCCTAGCC CCCTCCTGCC TCTCGCACCC AAAGGGCAGA ATCATCTTAG GA - #TGGAATCA        1260                                                                           - GTCGTTGTCT GGAAGCATCT CCTTGGAGCA GAAAGAGTCC TAAACATCGT CC - #TCGTAGCT        1320                                                                           - CTCTCTGTCT GTCTGTAGCC ACGAAGGCAG AGGTCAGGGT CACCAGGGCA GT - #GATGATTC        1380                                                                           - CCAGTTAACA GAGGAGGAGA CTGAGGTCTA GAGAGATGGA TTATTCCAAA GC - #CTCAAACA        1440                                                                           - TCCAGATCGG CTGAGGGTGG GGTTGGTGGC AGGGATGGCT CCTGGGCTTG GG - #AAGCTCGG        1500                                                                           - ATCCTGCCTC AGTCTCCCAC CTGACGCCAT CATCCCCCTC TCTCTCCTCC CA - #CAGTGCTG        1560                                                                           - AGAAAGTATT AAGGAGGAAG TCCCGACTGC CCACATCTGC ATTGGATTCT TC - #AGCAGCCC        1620                                                                           - CTGAGCCCCT TGGAAGCAGA TCTTATTTAT TCGTATTTAT TTATTTATTT AT - #TTCGATTG        1680                                                                           - TTTTATATAA GATGATCCTG ACGCCCGAGC ACGGATTTTC CACGGTGAAA TA - #AAGTCAAC        1740                                                                           - CTTAGAGCTT CTTTTGAAAC CGATTTGTCC CTGTGCATTA AAAGTAACAC AT - #CATTTAAA        1800                                                                           #           1804                                                               - (2) INFORMATION FOR SEQ ID NO:43:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 131 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:                                 - Met Glu Pro Cys Ala Ala Leu Pro Arg Ala Le - #u Leu Leu Leu Leu Phe          #                15                                                            - Leu His Leu Ser Pro Leu Gly Gly Arg Pro Hi - #s Pro Leu Gly Gly Arg          #            30                                                                - Ser Pro Ala Ser Glu Ala Ser Glu Ala Ser Gl - #u Leu Leu Gly Arg Leu          #        45                                                                    - Lys Asp Ala Val Ser Glu Leu Gln Ala Glu Gl - #n Leu Ala Leu Glu Pro          #    60                                                                        - Leu His Arg Ser His Ser Pro Ala Glu Ala Pr - #o Glu Ala Gly Gly Thr          #80                                                                            - Pro Arg Gly Val Leu Ala Pro His Asp Ser Va - #l Leu Gln Ala Leu Arg          #                95                                                            - Arg Leu Arg Ser Pro Lys Met Met His Lys Se - #r Gly Cys Phe Gly Arg          #           110                                                                - Arg Leu Asp Arg Ile Gly Ser Leu Ser Gly Le - #u Gly Cys Asn Val Leu          #       125                                                                    - Arg Lys Tyr                                                                      130                                                                        - (2) INFORMATION FOR SEQ ID NO:44:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 1519 base                                                          (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (ix) FEATURE:                                                                      (A) NAME/KEY: CDS                                                              (B) LOCATION: 134...265                                                        (D) OTHER INFORMATION:                                                         (A) NAME/KEY: CDS                                                              (B) LOCATION: 497...752                                                        (D) OTHER INFORMATION:                                                         (A) NAME/KEY: CDS                                                              (B) LOCATION: 1295...1308                                                      (D) OTHER INFORMATION:                                               -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:                                 - CCCACGGTGT CCCGAGGAGC CAGGAGGAGC ACCCCGCAGG CTGAGGGCAG GT - #GGGAAGCA          60                                                                           - AACCCGGACG CATCGCAGCA GCAGCAGCAG CAGCAGAAGC AGCAGCAGCA GC - #CTCCGCAG         120                                                                           - TCCCTCCAGA GACATGGATC CCCAGACAGC ACCTTCCCGG GCGCTCCTGC TC - #CTGCTCTT         180                                                                           - CTTGCATCTG GCTTTCCTGG GAGGTCGTTC CCACCCGCTG GGCAGCCCCG GT - #TCAGCCTC         240                                                                           - GGACTTGGAA ACGTCCGGGT TACAGGTGAG AGCGGAGGGC AGCTCAGGGG GA - #TTGGACAG         300                                                                           - CAGCAATGAA AGGGTCCTCA CCTGCTGTCC CAAGAGGCCC TCATCTTTCC TT - #TGGAATTA         360                                                                           - GTGATAAAGG AATCAGAAAA TGGAGAGACT GGGTGCCCTG ACCCTGTACC CA - #AGGCAGTC         420                                                                           - GGTTCACTTG GGTGCCATGA AGGGCTGGTG AGCCAGGGGT GGGTCCCTGA GG - #CTTGGACG         480                                                                           - CCCCCATTCA TTGCAGGAGC AGCGCAACCA TTTGCAGGGC AAACTGTCGG AG - #CTGCAGGT         540                                                                           - GGAGCAGACA TCCCTGGAGC CCCTCCAGGA GAGCCCCCGT CCCACAGGTG TC - #TGGAAGTC         600                                                                           - CCGGGAGGTA GCCACCGAGG GCATCCGTGG GCACCGCAAA ATGGTCCTCT AC - #ACCCTGCG         660                                                                           - GGCACCACGA AGCCCCAAGA TGGTGCAAGG GTCTGGCTGC TTTGGGAGGA AG - #ATGGACCG         720                                                                           - GATCAGCTCC TCCAGTGGCC TGGGCTGCAA AGGTAAGCAC CCCCTGCCAC CC - #CGGCCGCC         780                                                                           - TTCCCCCATT CCAGTGTGTG ACACTGTTAG AGTCACTTTG GGGTTTGTTG TC - #TCTGGGAA         840                                                                           - CCACACTCTT TGAGAAAAGG TCACCTGGAC ATCGCTTCCT CTTGTTAACA GC - #CTTCAGGG         900                                                                           - CCAAGGGGTG CCTTTGTGGA ATTAGTAAAT GTGGGCTTAT TTCATTACCA TG - #CCCACAAT         960                                                                           - ACCTTCTCCC CACCTCCTAC TTCTTATCAA AGGGGCAGAA TCTCCTTTGG GG - #GTCTGTTT        1020                                                                           - ATCATTTGGC AGCCCCCCAG TGGTGCAGAA AGAGAACCAA ACATTTCCTC CT - #GGTTTCCT        1080                                                                           - CTAAACTGTC TATAGTCTCA AAGGCAGAGA GCAGGATCAC CAGAGCAATG AT - #AATCCCCA        1140                                                                           - ATTTACAGAT GAGGAAACTG AGGCTCAGAG AGTTGCATTA AGCCTCAAAC GT - #CTGATGAC        1200                                                                           - TAACAGGGTG GTGGGTGGCA CACGATGAGG TAAGCTCAGC CCCTGCCTCC AT - #CTCCCACC        1260                                                                           - CTAACCATCA TCACCCTCTC TCTTTCCCTG ACAGTGCTGA GGCGGCATTA AG - #AGGAAGTC        1320                                                                           - CTGGCTGCAG ACACCTGCTT CTGATTCCAC AAGGGGCTTT TTCCTCAACC CT - #GTGGCCCT        1380                                                                           - CATCTTTCCT TTGGAATTAG TGATAAAGGA ATCAGAAAAT GGAGAGACTG GG - #TGCCCTGA        1440                                                                           - CCCTGTACCC AAGGCAGTCG GTTCACTTGG GTGCCATGAA GGGCCTGGTG AG - #CCAGGGGT        1500                                                                           #                 151 - #9                                                     - (2) INFORMATION FOR SEQ ID NO:45:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 134 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -      (v) FRAGMENT TYPE: internal                                             -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45:                                 - Met Asp Pro Gln Thr Ala Pro Ser Arg Ala Le - #u Leu Leu Leu Leu Phe          #                15                                                            - Leu His Leu Ala Phe Leu Gly Gly Arg Ser Hi - #s Pro Leu Gly Ser Pro          #            30                                                                - Gly Ser Ala Ser Asp Leu Glu Thr Ser Gly Le - #u Gln Glu Gln Arg Asn          #        45                                                                    - His Leu Gln Gly Lys Leu Ser Glu Leu Gln Va - #l Glu Gln Thr Ser Leu          #    60                                                                        - Glu Pro Leu Gln Glu Ser Pro Arg Pro Thr Gl - #y Val Thr Lys Ser Arg          #80                                                                            - Glu Val Ala Thr Glu Gly Ile Arg Gly His Ar - #g Lys Met Val Leu Tyr          #                95                                                            - Thr Leu Arg Ala Pro Arg Ser Pro Lys Met Va - #l Gln Gly Ser Gly Cys          #           110                                                                - Phe Gly Arg Lys Met Asp Arg Ile Ser Ser Se - #r Ser Gly Leu Gly Cys          #       125                                                                    - Lys Val Leu Arg Arg His                                                          130                                                                        - (2) INFORMATION FOR SEQ ID NO:46:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 131 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46:                                 - Met Gly Pro Arg Met Ala Leu Pro Arg Val Le - #u Leu Leu Leu Phe Leu          #                15                                                            - His Leu Leu Leu Leu Gly Cys Arg Ser His Pr - #o Leu Gly Gly Ala Gly          #            30                                                                - Leu Ala Ser Glu Leu Pro Gly Ile Gln Glu Le - #u Leu Asp Arg Leu Arg          #        45                                                                    - Asp Arg Val Ser Glu Leu Gln Ala Glu Arg Th - #r Asp Leu Glu Pro Leu          #    60                                                                        - Arg Gln Asp Arg Gly Leu Thr Glu Ala Trp Gl - #u Ala Arg Glu Ala Ala          #80                                                                            - Pro Thr Gly Val Leu Gly Pro Arg Ser Ser Il - #e Phe Gln Val Leu Arg          #                95                                                            - Gly Ile Arg Ser Pro Lys Thr Met Arg Asp Se - #r Gly Cys Phe Gly Arg          #           110                                                                - Arg Leu Asp Arg Ile Gly Ser Leu Ser Gly Le - #u Gly Cys Asn Val Leu          #       125                                                                    - Arg Arg Tyr                                                                      130                                                                        - (2) INFORMATION FOR SEQ ID NO:47:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 132 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:                                 - Met Glu Pro Cys Ala Ala Leu Pro Arg Ala Le - #u Leu Leu Leu Leu Phe          #                15                                                            - Leu His Leu Ser Pro Leu Gly Gly Arg Pro Hi - #s Pro Leu Gly Gly Arg          #            30                                                                - Ser Pro Ala Ser Glu Ala Ser Glu Ala Ser Gl - #u Leu Leu Gly Arg Leu          #        45                                                                    - Lys Asp Ala Val Ser Glu Leu Gln Ala Glu Gl - #n Leu Ala Leu Glu Pro          #    60                                                                        - Leu Arg His Arg Ser His Ser Pro Ala Ala Tr - #p Pro Ala Arg Gly Gly          #80                                                                            - Thr Pro Arg Gly Val Leu Ala Pro His Asp Se - #r Val Leu Gln Ala Leu          #                95                                                            - Arg Arg Leu Arg Ser Pro Lys Met Met His Ly - #s Ser Gly Cys Phe Gly          #           110                                                                - Arg Arg Leu Asp Arg Ile Gly Ser Leu Ser Gl - #y Leu Gly Cys Asn Val          #       125                                                                    - Leu Arg Lys Tyr                                                                  130                                                                        - (2) INFORMATION FOR SEQ ID NO:48:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 134 amino                                                          (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:                                 - Met Asp Pro Gln Thr Ala Pro Ser Arg Ala Le - #u Leu Leu Leu Leu Phe          #                15                                                            - Leu His Leu Ala Phe Leu Gly Gly Arg Ser Hi - #s Pro Leu Gly Ser Pro          #            30                                                                - Gly Ser Ala Ser Asp Leu Glu Thr Ser Gly Le - #u Gln Glu Gln Arg Asn          #        45                                                                    - His Leu Gln Gly Lys Leu Ser Glu Leu Gln Va - #l Glu Gln Thr Ser Leu          #    60                                                                        - Glu Pro Leu Gln Glu Ser Pro Arg Pro Thr Gl - #y Val Trp Lys Ser Arg          #80                                                                            - Glu Val Ala Thr Glu Gly Ile Arg Gly His Ar - #g Lys Met Val Leu Tyr          #                95                                                            - Thr Leu Arg Ala Pro Arg Ser Pro Lys Met Va - #l Gln Gly Ser Gly Cys          #           110                                                                - Phe Gly Arg Lys Met Asp Arg Ile Ser Ser Se - #r Ser Gly Leu Gly Cys          #       125                                                                    - Lys Val Leu Arg Arg His                                                          130                                                                        - (2) INFORMATION FOR SEQ ID NO:49:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 5 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49:                                 - Asn Val Leu Arg Lys                                                           1               5                                                             - (2) INFORMATION FOR SEQ ID NO:50:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 6 amino                                                            (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50:                                 - Asn Val Leu Arg Lys Tyr                                                       1               5                                                             __________________________________________________________________________ 

We claim:
 1. A peptide having natriuretic activity of the formula (SEQ ID NO: 15) R¹ -Cys-Phe-Gly-Arg-Arg-Leu-Asp-Arg-Ile-Gly-Ser-Leu-Ser-Gly-Leu-Gly-Cys-R.sup.2wherein R¹ (SEQ ID NOs:6-11) is selected from the group consisting of:(H); Gly-; Ser-Gly-; Lys-Ser-Gly-; His-Lys-Ser-Gly-; Met-His-Lys-Ser-Gly-; Thr-Met-His-Lys-Ser-Gly-; Lys-Thr-Met-His-Lys-Ser-Gly-; Pro-Lys-Thr-Met-His-Lys-Ser-Gly-; Ser-Pro-Lys-Thr-Met-His-Lys-Ser-Gly-; or is the amino acid sequence of the dog prepro sequence (SEQ ID NO: 47) upstream of position or a C-terminal portion thereof; R² (SEQ ID NOs: 12, 49-50) is (OH), NH₂, or NR₂ wherein each R is independently H or lower alkyl (1-4C) or isAsn; Asn-Val; Asn-Val-Leu; Asn-Val-Leu-Arg; Asn-Val-Leu-Arg-Lys; Asn-Val-Leu-Arg-Lys-Tyr; or the amides thereof.
 2. The peptide of claim 1 wherein R¹ is Ser-Pro-Lys-Thr-Met-His-Lys-Ser-Gly-(SEQ ID NO: 11) and wherein R² is Asn-Val-Leu-Arg-Lys-Tyr(SEQ ID NO: 50).
 3. A recombinant DNA in isolated form which comprises a nucleotide sequence that encodes the peptide of claim
 1. 4. A recombinant DNA in isolated form which comprises a nucleotide sequence that encodes the peptide of claim
 2. 5. A recombinant expression system capable, when contained in a recombinant host cell, of expressing a nucleotide sequence encoding the peptide of claim 1, said expression system comprising said encoding nucleotide sequence operably linked to control sequences to effect its expression.
 6. A recombinant expression system capable, when contained in a recombinant host cell, of expressing a nucleotide sequence encoding the peptide of claim 2, said expression system comprising said encoding nucleotide sequence operably linked to control sequences to effect its expression.
 7. A recombinant host cell or cell culture which has been manipulated to contain the expression system of claim
 6. 8. A recombinant host cell or cell culture which has been manipulated to contain the expression system of claim
 7. 9. A method to produce a peptide having natriuretic activity which method comprises culturing the cells of claim 8 under conditions which permit the expression of the nucleotide sequence encoding the peptide.
 10. A method to produce a peptide having natriuretic activity which method comprises culturing the cells of claim 8 under conditions which permit the expression of the nucleotide sequence encoding the peptide.
 11. A pharmaceutical or veterinary composition useful in treating conditions associated with a high level of extracellular fluid which composition comprises an effective amount of the peptide of claim 1 in admixture with a suitable excipient.
 12. A pharmaceutical or veterinary composition useful in treating conditions associated with a high level of extracellular fluid which composition comprises an effective amount of the peptide of claim 2 in admixture with a suitable excipient.
 13. A method to treat a condition characterized by a high level of extracellular fluid which comprises administering to a subject in need of such treatment an effective amount of the peptide of claim 1 or a formulation thereof.
 14. A method to treat a condition characterized by a high level of extracellular fluid which comprises administering to a subject in need of such treatment an effective amount of the peptide of claim 2 or a formulation thereof. 